Gene GM21_3877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3877 
Symbol 
ID8139251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4462731 
End bp4463783 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content58% 
IMG OID644871494 
ProductTrkA-N domain protein 
Protein accessionYP_003023652 
Protein GI253702463 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0569] K+ transport systems, NAD-binding component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.705897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCGG TCCGGCACCT GAAAATCTCG ATAGGAGTAT TGTTGCTCCT GCTGTCGTTC 
GGCACCTTCG GCTACATCGC TATCGAGGGG TGGGACACTC TCGACGCCTT GTACATGACG
GTGATCACAC TGGGCACCGT AGGCTTCAGG GAGGTTCACA ACCTGAGCTC GGCCGGCAAG
ATCTTCACCA TGCTGCTGAT ATTTTTCGGT GTCGGTGTCA TCGGTTATAT CGTGGGCAGC
CTGGCCCAGA TCATGTTCGA GGGGCAGTTT CAGCGGATCA TGGGGAGGAA GAAGGTGGAA
AAGGCAATTG CTGCGCTGGA AGGGCATTAC ATCATCTGCG GGTTCGGCCG GATAGGGTCG
TTGATCTGCA AGGAGTTCTC GGCGAAGCCT CTGCCGTTCG TGGTGGTGGA AAAGGACCCG
GCCATGGTGG ACATCATGGA GCAGGACGGA CCGGGCTACC TGGTGTTGCG CGGCGATGCG
ACCATAGACG ACGTGCTCCT GAAGGCGGGG ATCAAGAAGG CGCGCGGACT TATTTCGGTG
GTCACCTCGG ACACCGAGAA CGTCTACATA ACCCTCACCG CCCGCGGGCT TAACCCGGAT
CTCTTCATCC TGGCGCGCGC CGGAGAGGAG GGCTCCGAAA TCAAGCTGAA GCGGGCCGGC
GCCAACAAGG TCGTCTCTCC CTATCTCATC GGCGGTTCCC GCATGGCCCA GGCGATACTG
CGCCCGACGG TGGTCGACTT CATCGAGATC GCCACGGGGC ACGAGCACAT GGAGTTGCAG
ATGGAGGAAA TCCTGATTCC GCCAGGTTGC GGCTTCATCG GAGAGACGCT GGCCAGTTCG
GGATTCAGGA AAGAAACCGG GGTCATCATC GTCGGCGTCA AGAAGCAAAA CGGCAAGATG
GTGTTCAATC CGGAGTCCCA CACGAAGCTG GAGGCGCACG ACACGCTGAT CGTTTTGGGC
GAACCCGCGG CGATTCAAAA ACTGGAGCAG TTGGTCGGCT GCGATACCTG CGCCGAAGAA
CTGATCAAAA AGCACAGGAA AAGAGATGAC TAA
 
Protein sequence
MDPVRHLKIS IGVLLLLLSF GTFGYIAIEG WDTLDALYMT VITLGTVGFR EVHNLSSAGK 
IFTMLLIFFG VGVIGYIVGS LAQIMFEGQF QRIMGRKKVE KAIAALEGHY IICGFGRIGS
LICKEFSAKP LPFVVVEKDP AMVDIMEQDG PGYLVLRGDA TIDDVLLKAG IKKARGLISV
VTSDTENVYI TLTARGLNPD LFILARAGEE GSEIKLKRAG ANKVVSPYLI GGSRMAQAIL
RPTVVDFIEI ATGHEHMELQ MEEILIPPGC GFIGETLASS GFRKETGVII VGVKKQNGKM
VFNPESHTKL EAHDTLIVLG EPAAIQKLEQ LVGCDTCAEE LIKKHRKRDD