Gene GM21_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0829 
Symbol 
ID8136145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp984614 
End bp985816 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content61% 
IMG OID644868443 
Productprotein of unknown function DUF214 
Protein accessionYP_003020657 
Protein GI253699468 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones96 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGG CTATCCGCGA CATACGTTAC CATCAGGGGC GGTTTATCCT GACCACCGTG 
GGGCTCGGGC TGCTCATCGG CGTCGTGATA AGCATGGGGG GGATCTATCG CGGTCTCTCT
GCCGACGCTC TGGCCGTCCA GGAATCCACC AAAGCGGACA TCTGGGTGGT GCAGCAGGGG
ACCAACGGCC CCTTCGCCGA AAGCTCGCGC ATTCCGGAAG ATATCAGGTA CCGGATCAAG
GGGGTGCCAG GTGTTGCCGA AGCGTCGCCT CTCTCCTTCC AGACCATCCA GGTCGAGCGT
CAGGGGAAGC CCTTCCGGTT CTTCCTGATC GGCCATGACC TGAACGGCCT TGGTGGTCCG
CCGAACATCA TCGCCGGCCG GAACATCCGC CAGAAGCACT ACGAGATGGT GGCCGCCAAG
GCGCTGAAAA TGGAGATCGG CGAGAAGATC CGCCTGGGAC GCCACGACTA TACCGTCGTG
GGCCTCACCG GCAACGTCGT ATCTTCCGGA GGCGACCCGG CAGCGTACGT GAGCCTGGCG
GATGCCCAGG AGATCCAGTT CAAGAAAGAC GACGACGCCA TCCGCAACGA CCGTGCCCGG
ATCGATGCAA ACCTCGCCAG GATCCAGACC CTGCCTCCGG CCCAGATAAC GGGCCTGCAA
AGGAACATCG CCGGGATCAC CGAATCGACG CATACGGTCA ACACAGTCGT CGCGCGCCTG
GCGCCGGGCG CGGATCTCCA GGAGGTACAA GAGCGGATAA GCCGCTGGAA CCACTACCGC
CCCATCTCGG CGGAGGAACA GACCAGGATC CTCACCAAGG GGATGATTGA AAAGGCCCGT
ATGCAGATCG GGCTGTTTCG GGCAATCCTC CTCGTCATCT CTTCGGTCAT CATCTCGCTC
ATCATCTACA CCTCGACCAT CGACAAGATC AAGGCCATAG CAACCCTCAA GCTCATCGGC
GCGCAGAACC GGGTCATCGT CTGGATGATT CTGCAGCAGT CGCTCCTGAT GGGGCTGATC
GCCTATGGAA TCGGCTACCT CCTGATGCTG TTGACCTATG AAAAATTCCC CAGGCGCGTC
GTGCTGCAAG CCTTCGACCT GCAGGCGCTC TTTTTGATCG TCGTTGCGAT ATGTACCGTT
TCAAGTTTCG TGGGGATCCG GAAGGCGCTC AAAGTCGAGC CCGCCGTGGC CCTGGGGGGA
TAA
 
Protein sequence
MNLAIRDIRY HQGRFILTTV GLGLLIGVVI SMGGIYRGLS ADALAVQEST KADIWVVQQG 
TNGPFAESSR IPEDIRYRIK GVPGVAEASP LSFQTIQVER QGKPFRFFLI GHDLNGLGGP
PNIIAGRNIR QKHYEMVAAK ALKMEIGEKI RLGRHDYTVV GLTGNVVSSG GDPAAYVSLA
DAQEIQFKKD DDAIRNDRAR IDANLARIQT LPPAQITGLQ RNIAGITEST HTVNTVVARL
APGADLQEVQ ERISRWNHYR PISAEEQTRI LTKGMIEKAR MQIGLFRAIL LVISSVIISL
IIYTSTIDKI KAIATLKLIG AQNRVIVWMI LQQSLLMGLI AYGIGYLLML LTYEKFPRRV
VLQAFDLQAL FLIVVAICTV SSFVGIRKAL KVEPAVALGG