Gene GM21_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2026 
Symbol 
ID8137362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2349930 
End bp2351159 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID644869641 
Productprotein of unknown function DUF214 
Protein accessionYP_003021836 
Protein GI253700647 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.000000000157118 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTCCC TCTACCAGAG TTTCCTCATC GCGGTGCGCG CGCTTAGGGT GAACAAGATG 
CGGGCGCTGT TAACCATGCT GGGGATCATC ATCGGCATCG CCGCGGTCAT CGCCATGGTC
GCCATCGGCG CGGGTGCCAG CAAGATGATC TCCGACCAGA TCTCCAGCAT CGGGTCGAAC
CTGCTCCTGG TGCTTCCCGG CTCCACCACC AGCGGGGGGT TGCGCTCCGG CGCCGGGTCC
CACCAGACGC TCACCTACGA CGACGCCATG GCCATCAAGG CGGAATGCCC GTCGGTGGGG
GCTGTGGCGC CGCAGGTGCG CGGCTCGGGG CAGGTGGTTT ACGGCAACCA GAACTGGTCC
ACCGTCGTCT ACGGCGCCAC GCCGGACGTG ATCCAGGTGC GCGACTGGAC CATCGTGGCC
GGGCGCAACA TCACCCAGTC CGACGTCGAC GGCGCCACCA AGAACTGCCT GATCGGGCAG
ACCGTCGCCG ACAACCTCTT CGGCGCGGCC GATCCCATCG GGAAGATCAT CAGGATCAAG
AAGATACCCT TCACCGTGGT AGGGCTTTTG GGCGAAAAGG GGCAGTCCCC CCAGGGGCAG
GACCAGGACG ACGTCATCTA CGTGCCGCTT CGGACGGCGC AGCGAAAGCT TCTGGGGAGT
CAGTTCCCGA ACGTGGTCGG CTCCATCATG GTGCAGGCCA AAAGCGGCGA GGTGCTGGAC
CAGGCGGAGG AGGAGGTGAC GGCGCTTTTG AACCAGAGGC ACCGCATCGG CCCCAGCCGC
GAGGTCGACT TCACCATCAG GAACCTCTCC GAACTCCTGG CGGTCACCGC CCAGTCCTCG
AAGGTGATGT CGATCCTCCT GGGGGCGGTC GCCTCCATCT CGCTGGTGGT TGGCGGGATC
GGCATCATGA ACATCATGCT CGTCTCGGTC ACCGAGAGGA CCCGCGAGAT CGGGATCAGG
ATCGCCATCG GCGCCAAGAG GCGCGACATA CTGCTGCAGT TTCTCACCGA GGCGGTGCTC
CTCACCACCT GCGGCGGCAT CATCGGCATG CTGCTAGGCG TTGCGGGGGC GCGGCTGGTC
GCCTCGCTGG TGGGGTGGCC CACGCTGGTA TCGGTGAACA CCATCGTCGT CGCCTTTGCC
TTTTCCGCAG GTGTCGGGGT CTTCTTCGGG TTCTATCCGG CCCGCAAGGC CTCCTCTTTG
AACCCAATAG AAGCGCTGAG ATACGAATAA
 
Protein sequence
MSSLYQSFLI AVRALRVNKM RALLTMLGII IGIAAVIAMV AIGAGASKMI SDQISSIGSN 
LLLVLPGSTT SGGLRSGAGS HQTLTYDDAM AIKAECPSVG AVAPQVRGSG QVVYGNQNWS
TVVYGATPDV IQVRDWTIVA GRNITQSDVD GATKNCLIGQ TVADNLFGAA DPIGKIIRIK
KIPFTVVGLL GEKGQSPQGQ DQDDVIYVPL RTAQRKLLGS QFPNVVGSIM VQAKSGEVLD
QAEEEVTALL NQRHRIGPSR EVDFTIRNLS ELLAVTAQSS KVMSILLGAV ASISLVVGGI
GIMNIMLVSV TERTREIGIR IAIGAKRRDI LLQFLTEAVL LTTCGGIIGM LLGVAGARLV
ASLVGWPTLV SVNTIVVAFA FSAGVGVFFG FYPARKASSL NPIEALRYE