Gene GM21_2380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2380 
Symbol 
ID8137721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2777367 
End bp2779115 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content45% 
IMG OID644869995 
ProductCHAP domain containing protein 
Protein accessionYP_003022186 
Protein GI253700997 
COG category[R] General function prediction only 
COG ID[COG3942] Surface antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGT TCATACTATG GTTAACTCTC ATGGTTGCAG TCCCTTCACT GGCGCTCGCA 
ACAAAAATTA AAGATATTGA TATATCTCCA AAGCATCCTA ATGTCGGCGA TAATGTCAAA
ATCTCTGTGA AAACAAGTGG TGATGTTGCC AGGGCGGTTA TTGAATTCCC AGATTTGGGA
ATCCAGAAAG ATCTTGGGGT CATTGGGAAG AACACTTGGG GGATATTGAT TCCATTGAAC
TATTCGGGCG AAAAAAAATT CAAAGTTGTT TTGTATCAGA AAAAAAAGGA CAAAAATCCT
AAGGACACAG AAAAAGACAA AATTATCGTG TCCGCAAAAA ATGAGGACCG AAAGCCCAAG
CCAAGTGCTG AACCCAAGCC TGGAGCTGAA CCCAAGCCTG GCCCTGAACC CGACAAGCAA
AGACCGGTAG TTGCAGGTTT TAATATAAAT AAGGAGTCTT CGAAGGTAGG AGAAGTAATC
ACTATATCTT ATGTGGTGAG CGACTCAGGT GGTTCTGGGC TAAAGCAGGT TGAACTGTGG
CGTTCCCCGA ATAATAGTGA TTGGGCAGAG ATCAAAGAGA AGAGACGAAA CCTTTCAGGC
GGAAGCGCCT CAGGTTCATT TACTGATACA CCCTCGGCAC CTGGTACCTA CTATTATGGG
ATTCACGTAG TAGACAATAA TAATAACTGG AGTGGCGAGA GTGGTCCGAA GCGATTAATG
GTTGAACAGG CGGCCAGGCA GCAGCATGGT TCAGTGTCTG GGAGAATTCA TAAGAACTTC
GCTTCAGGCC CGGTGCTTTC CGGGGTCTCT GTGAATTGCG GAGGAAAGAG CAGTATAACG
GACGGAAACG GATATTTCAA GATCGATGGC ATAACGGCGG GCAACCATGG GATATCATAC
TCAAAGTCCG GTTATGACGG ATTTAATAGT CAGATTGCTG TCAAGGCTGG GTCGAATATT
AGCGAAGGTG AACGGTGGTT GACTGAAAAA AGCCCTGCCA CCCAGTTGCC GAATGTGACC
GGCGTCAACG TCAACCCCGC CAGCATTACG GCAGGTCAGG CAGCATTATT CACGGCGACT
ACCGATAGTG CAGCTTCAAT GGTAACTCTG CGCTTCACCG ATGCTGGCAT CGATGTGAAG
ATGTCCGGCA ATGGCACGAC TTGGAAGGCT TATCCGCAAA TCCACAATGC CGGCAATAGA
CCTTTCACAG TTACTGCATT TGACAAAAAT AATAAGTATG GCAGCGGAAG AAAAGGCACG
ATCCAAGTTA ACAAGGTGAA GGAGCCTGTT GCCTCCAATT TGCGTCCTGA TTTCACTTTG
CCAGCTTATC GAGAGAATAA TCCTTTTTGG AATAGCGGCT ATGCCCCGAA AGAAGTTGCT
CCCCCCAAAC CTAAACTCGA TAATGCCAAA GGAAATTGTA CATGGTATGC AAATGGCCGC
CTACGTGAAC TTGGTTATAA TGTGCCAAAT AATAGTTTTA CACATCATGC AAAAACATGG
GTTAGCGATG CGAAAAATAA TCATTTTGTC GTAGACCAAA CACCTCAAGT TGGTTCAATT
GCGCAATCAG ATACTATGAA CAAAAATTAT GGACATGTTG CTGTAGTAGA GGTCGTGCAT
AATGATGGAA CTATAGTGAT TTCTGAATCG AGTTACGCGC CAGGCACTAA AGATTGGGAT
TTTTTGTATA ACACTAGGAC TATCCATAAA TCCATATTTA CGAGCTATAT TCATGTGCCT
CGAAAATAG
 
Protein sequence
MKAFILWLTL MVAVPSLALA TKIKDIDISP KHPNVGDNVK ISVKTSGDVA RAVIEFPDLG 
IQKDLGVIGK NTWGILIPLN YSGEKKFKVV LYQKKKDKNP KDTEKDKIIV SAKNEDRKPK
PSAEPKPGAE PKPGPEPDKQ RPVVAGFNIN KESSKVGEVI TISYVVSDSG GSGLKQVELW
RSPNNSDWAE IKEKRRNLSG GSASGSFTDT PSAPGTYYYG IHVVDNNNNW SGESGPKRLM
VEQAARQQHG SVSGRIHKNF ASGPVLSGVS VNCGGKSSIT DGNGYFKIDG ITAGNHGISY
SKSGYDGFNS QIAVKAGSNI SEGERWLTEK SPATQLPNVT GVNVNPASIT AGQAALFTAT
TDSAASMVTL RFTDAGIDVK MSGNGTTWKA YPQIHNAGNR PFTVTAFDKN NKYGSGRKGT
IQVNKVKEPV ASNLRPDFTL PAYRENNPFW NSGYAPKEVA PPKPKLDNAK GNCTWYANGR
LRELGYNVPN NSFTHHAKTW VSDAKNNHFV VDQTPQVGSI AQSDTMNKNY GHVAVVEVVH
NDGTIVISES SYAPGTKDWD FLYNTRTIHK SIFTSYIHVP RK