Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_14730 |
Symbol | comA |
ID | 7760409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1454883 |
End bp | 1457018 |
Gene Length | 2136 bp |
Protein Length | 711 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643804371 |
Product | DNA internalization-related competence protein ComA |
Protein accession | YP_002798664 |
Protein GI | 226943591 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0603479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTGG GCGCTGCGAT CCTTCTGGCG GCGCGTCGCT ATTCGCCGGC GCTCTTTCTC TTCGGTCTCG GCTGGGCCTG CCTGTCGGCG CACTGGGCGC TGGAGGAACG GTTGCCTGTC GAACTCGATG GTCGCACCCT GTGGCTGGAG GGGCTGGTGG TCGGTCTGCC GGCGCGCATC GACGGCACGC TGCATTTCCA ACTGGAGGAG GCCTCTTCCC GGCGCGCCGA ACTGCCCGGG CGACTGCGCC TAGCGTGGCA CGCCGGGCCG GAGGTCCGCG CCGGGGAGCG CTGGCGCCTG GCGGTCAGCC TCAAGCGTCC GCGCGGCCTG GTCAACCCGC AGGGTTTCGA TTACGAGGCC TGGCTGCTGG CCCAGCGGAT CGGCGCCACC GGGACGGTGA AAGCGGGAGA GCGACTCGGA ACGCCGGAAA ACGCCGACGG TTGGCGCGAT TCCCTGCGCC AGCGCCTGCT GCAGGTCGAT GCCCATGGCC GTGAGGGCGC GCTCGCCGCG CTGGTGATGG GCGACGCGTC CGGGCTGAGC GTGGCGGACT GGAAGCTCCT GCAGGATACC GGCACCGTGC ATCTGATGGT GATCTCCGGC CAGCATGTCG GCCTGCTTGC CGGCCTGGTC TACGGGCTGG TGGTCCTGCT GGCGAGATTT GGCCTGTGGC CGGGTTTTCT GCCCTGGTTG CCCTGTGCCT GCGGCCTGGC CTTCGCCACC GCGCTCGGTT ATGGCTGGCT GGCCGGCTTC GGGGTACCAG TACAGCGGGC CTGCGCCATG CTCGCCGTGG TGCTGTTCTG GCGCCTGCGT TTCCGCCACC TGGGTCTCTG GCTACCCATC CTGCTGGCGC TGGACGGCGT ACTGCTGCTC GAGCCCCTGG CCAGCCTGCA GCCGGGGTTC TGGCTGTCGT TCGGTGCGGT GGTGATCCTC GTCCTGGCCT TCGGCGGCCG GCTGGGTGCC TGGTCGTGGC GGCAGACCCT GTGGCGAGCG CAGTGGACCA GTGCGCTGGG ACTGCTACCG TTGTTGCTGG CCTTGGGCCT GCCGATCAGT CTCAGCGGTC CGTTGGCCAA TCTGGTCGCG GTACCCTGGG TCGGTTTCGC GGTGGTCCCG CTGGCTCTGC TCGGAACCCT GCTGCTGCCC TTGCCGGCAA TGGGCGAGGG CCTTCTCTGG CTGGCCGGCG CCTTGCTGGA GACGCTGTTT CGGCTGCTCG GCGAGATCGC CGGCGCCGTA CCGGCCTGGC TGCCCCACGC GGTGCCGGTC TGGGGCTGGC TGCTGGCGCT GCTCGGGACC CTGCTGATCC TGCTGCCGGC GGGAGTGCCG CTGCGTGTCC CGGGACTGGC GCTGCTGCTG CCCCTGGCAT TTCCGCCGCA GGAGCGAATC CCGCAGGCAC GGGCCGATGT CTGGCTGCTG GATGTCGGGC AGGGCCTTGC CGTGCTTGTG CGTACCCGCG GGCACGACCT GCTCTATGAT GCTGGGCCGC GTTTCGGCGA TTTCGATCTG GGCGAGCGCG TGGTCCTGCC TTCGCTGCGC AATCTCGGCG TGGGCCGCCT GGATCGCCTG CTGCTCAGCC ATGCCGATGG CGACCACGCC GGTGGCGCCC TGGCCGTGCG GCGCGCTCTG CCGGTGGGCG AGGTCGTCGC CGGCGAGGCG CAGGCGCAAT CGGCGGCGCT CGCCGCGCAG CCTTGCGCCC GTCGCGCCTG GCAGTGGGAT GGTGTGCGTT TCGCCACCTG GCACTGGACG GCCGTGCAAG AGGGCAATCG GGCTTCCTGC GTGCTGCTGG TCGAGGCCGC CGGCGAGCGC CTGCTGCTGA CCGGCGATAT CGATGCCGCA GCCGAGCGGG CACTGCTCGA CAGCCACCCG GAGTGGCGCG CCGACTGGCT GCTGGCGCCT CACCACGGCA GCCGCAGTTC GTCTTCGCCG GCTCTGCTCA AGGCCCTGGC GCCGCGCGCG GTGCTGATCT CGCGCGGCTG GAACAACGGC TTCGGCCATC CCCATGCGCA GGTCGTGGAG CGTTACCGGA AGCTGCCGGC CGTGATTCAC GATACTGCGC GCCAGGGGGC CCTGCGGTTT CGCCTGGGCG ACTGGGGCCG GGCGCGCGGG CTGCGCGAAG AGCCCCGCTT CTGGCGGGAA AAATGA
|
Protein sequence | MALGAAILLA ARRYSPALFL FGLGWACLSA HWALEERLPV ELDGRTLWLE GLVVGLPARI DGTLHFQLEE ASSRRAELPG RLRLAWHAGP EVRAGERWRL AVSLKRPRGL VNPQGFDYEA WLLAQRIGAT GTVKAGERLG TPENADGWRD SLRQRLLQVD AHGREGALAA LVMGDASGLS VADWKLLQDT GTVHLMVISG QHVGLLAGLV YGLVVLLARF GLWPGFLPWL PCACGLAFAT ALGYGWLAGF GVPVQRACAM LAVVLFWRLR FRHLGLWLPI LLALDGVLLL EPLASLQPGF WLSFGAVVIL VLAFGGRLGA WSWRQTLWRA QWTSALGLLP LLLALGLPIS LSGPLANLVA VPWVGFAVVP LALLGTLLLP LPAMGEGLLW LAGALLETLF RLLGEIAGAV PAWLPHAVPV WGWLLALLGT LLILLPAGVP LRVPGLALLL PLAFPPQERI PQARADVWLL DVGQGLAVLV RTRGHDLLYD AGPRFGDFDL GERVVLPSLR NLGVGRLDRL LLSHADGDHA GGALAVRRAL PVGEVVAGEA QAQSAALAAQ PCARRAWQWD GVRFATWHWT AVQEGNRASC VLLVEAAGER LLLTGDIDAA AERALLDSHP EWRADWLLAP HHGSRSSSSP ALLKALAPRA VLISRGWNNG FGHPHAQVVE RYRKLPAVIH DTARQGALRF RLGDWGRARG LREEPRFWRE K
|
| |