Gene Avin_14730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_14730 
SymbolcomA 
ID7760409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1454883 
End bp1457018 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content71% 
IMG OID643804371 
ProductDNA internalization-related competence protein ComA 
Protein accessionYP_002798664 
Protein GI226943591 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0603479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTGG GCGCTGCGAT CCTTCTGGCG GCGCGTCGCT ATTCGCCGGC GCTCTTTCTC 
TTCGGTCTCG GCTGGGCCTG CCTGTCGGCG CACTGGGCGC TGGAGGAACG GTTGCCTGTC
GAACTCGATG GTCGCACCCT GTGGCTGGAG GGGCTGGTGG TCGGTCTGCC GGCGCGCATC
GACGGCACGC TGCATTTCCA ACTGGAGGAG GCCTCTTCCC GGCGCGCCGA ACTGCCCGGG
CGACTGCGCC TAGCGTGGCA CGCCGGGCCG GAGGTCCGCG CCGGGGAGCG CTGGCGCCTG
GCGGTCAGCC TCAAGCGTCC GCGCGGCCTG GTCAACCCGC AGGGTTTCGA TTACGAGGCC
TGGCTGCTGG CCCAGCGGAT CGGCGCCACC GGGACGGTGA AAGCGGGAGA GCGACTCGGA
ACGCCGGAAA ACGCCGACGG TTGGCGCGAT TCCCTGCGCC AGCGCCTGCT GCAGGTCGAT
GCCCATGGCC GTGAGGGCGC GCTCGCCGCG CTGGTGATGG GCGACGCGTC CGGGCTGAGC
GTGGCGGACT GGAAGCTCCT GCAGGATACC GGCACCGTGC ATCTGATGGT GATCTCCGGC
CAGCATGTCG GCCTGCTTGC CGGCCTGGTC TACGGGCTGG TGGTCCTGCT GGCGAGATTT
GGCCTGTGGC CGGGTTTTCT GCCCTGGTTG CCCTGTGCCT GCGGCCTGGC CTTCGCCACC
GCGCTCGGTT ATGGCTGGCT GGCCGGCTTC GGGGTACCAG TACAGCGGGC CTGCGCCATG
CTCGCCGTGG TGCTGTTCTG GCGCCTGCGT TTCCGCCACC TGGGTCTCTG GCTACCCATC
CTGCTGGCGC TGGACGGCGT ACTGCTGCTC GAGCCCCTGG CCAGCCTGCA GCCGGGGTTC
TGGCTGTCGT TCGGTGCGGT GGTGATCCTC GTCCTGGCCT TCGGCGGCCG GCTGGGTGCC
TGGTCGTGGC GGCAGACCCT GTGGCGAGCG CAGTGGACCA GTGCGCTGGG ACTGCTACCG
TTGTTGCTGG CCTTGGGCCT GCCGATCAGT CTCAGCGGTC CGTTGGCCAA TCTGGTCGCG
GTACCCTGGG TCGGTTTCGC GGTGGTCCCG CTGGCTCTGC TCGGAACCCT GCTGCTGCCC
TTGCCGGCAA TGGGCGAGGG CCTTCTCTGG CTGGCCGGCG CCTTGCTGGA GACGCTGTTT
CGGCTGCTCG GCGAGATCGC CGGCGCCGTA CCGGCCTGGC TGCCCCACGC GGTGCCGGTC
TGGGGCTGGC TGCTGGCGCT GCTCGGGACC CTGCTGATCC TGCTGCCGGC GGGAGTGCCG
CTGCGTGTCC CGGGACTGGC GCTGCTGCTG CCCCTGGCAT TTCCGCCGCA GGAGCGAATC
CCGCAGGCAC GGGCCGATGT CTGGCTGCTG GATGTCGGGC AGGGCCTTGC CGTGCTTGTG
CGTACCCGCG GGCACGACCT GCTCTATGAT GCTGGGCCGC GTTTCGGCGA TTTCGATCTG
GGCGAGCGCG TGGTCCTGCC TTCGCTGCGC AATCTCGGCG TGGGCCGCCT GGATCGCCTG
CTGCTCAGCC ATGCCGATGG CGACCACGCC GGTGGCGCCC TGGCCGTGCG GCGCGCTCTG
CCGGTGGGCG AGGTCGTCGC CGGCGAGGCG CAGGCGCAAT CGGCGGCGCT CGCCGCGCAG
CCTTGCGCCC GTCGCGCCTG GCAGTGGGAT GGTGTGCGTT TCGCCACCTG GCACTGGACG
GCCGTGCAAG AGGGCAATCG GGCTTCCTGC GTGCTGCTGG TCGAGGCCGC CGGCGAGCGC
CTGCTGCTGA CCGGCGATAT CGATGCCGCA GCCGAGCGGG CACTGCTCGA CAGCCACCCG
GAGTGGCGCG CCGACTGGCT GCTGGCGCCT CACCACGGCA GCCGCAGTTC GTCTTCGCCG
GCTCTGCTCA AGGCCCTGGC GCCGCGCGCG GTGCTGATCT CGCGCGGCTG GAACAACGGC
TTCGGCCATC CCCATGCGCA GGTCGTGGAG CGTTACCGGA AGCTGCCGGC CGTGATTCAC
GATACTGCGC GCCAGGGGGC CCTGCGGTTT CGCCTGGGCG ACTGGGGCCG GGCGCGCGGG
CTGCGCGAAG AGCCCCGCTT CTGGCGGGAA AAATGA
 
Protein sequence
MALGAAILLA ARRYSPALFL FGLGWACLSA HWALEERLPV ELDGRTLWLE GLVVGLPARI 
DGTLHFQLEE ASSRRAELPG RLRLAWHAGP EVRAGERWRL AVSLKRPRGL VNPQGFDYEA
WLLAQRIGAT GTVKAGERLG TPENADGWRD SLRQRLLQVD AHGREGALAA LVMGDASGLS
VADWKLLQDT GTVHLMVISG QHVGLLAGLV YGLVVLLARF GLWPGFLPWL PCACGLAFAT
ALGYGWLAGF GVPVQRACAM LAVVLFWRLR FRHLGLWLPI LLALDGVLLL EPLASLQPGF
WLSFGAVVIL VLAFGGRLGA WSWRQTLWRA QWTSALGLLP LLLALGLPIS LSGPLANLVA
VPWVGFAVVP LALLGTLLLP LPAMGEGLLW LAGALLETLF RLLGEIAGAV PAWLPHAVPV
WGWLLALLGT LLILLPAGVP LRVPGLALLL PLAFPPQERI PQARADVWLL DVGQGLAVLV
RTRGHDLLYD AGPRFGDFDL GERVVLPSLR NLGVGRLDRL LLSHADGDHA GGALAVRRAL
PVGEVVAGEA QAQSAALAAQ PCARRAWQWD GVRFATWHWT AVQEGNRASC VLLVEAAGER
LLLTGDIDAA AERALLDSHP EWRADWLLAP HHGSRSSSSP ALLKALAPRA VLISRGWNNG
FGHPHAQVVE RYRKLPAVIH DTARQGALRF RLGDWGRARG LREEPRFWRE K