Gene GM21_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2037 
Symbol 
ID8137373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2361087 
End bp2362226 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content62% 
IMG OID644869652 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003021847 
Protein GI253700658 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000000000000596452 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAAAGA TTCTGGTGGT ACTCGGCATC ACGATGCTTC TGTGCAGCGC CAGCGCCGCA 
CTCGCCGCGG CGCCGATCAA GATCGGCGGC CTGTTCGCCG TTACCGGTCC CGCAGCTTTC
CTCGGCGAAC CGGAGAAGAA AACGCTGGAG CTGCTGGTCA AAGAGGCGAA CGCCAAGGGC
GGCATCAACG GCTCGAAGAT CGAGCTGGTC GTCTACGACA CAGGCGGCGA CGTCACCAAG
GCCGTTCAGC TCGCCAACAA GCTGATCAAA AACGACAAGG TCTCCGCCAT CGTCGGCCCC
AGCACCACCG GCGAAACCAT GGCCGTCATC CCCATCGCCG AGAAGGAGCA GGTCCCGCTC
ATCTCCTGCG CCGCGGGGAT CAAGATCACC GCCCCGGTGA AGAAATGGGT CTTCAAGACC
CCGGCCAACG ACCACGTCGC CGCAGAAAAG ATCCTGCTCC AGGCTGCCAG ACTAAAGCAA
AAGAACATCG CCATTCTCAC CGTTTCCGAT TCCTTCGGCT CTTCCGGGCG CGAGCAGTTG
AAGCAGATGG CGGCGAAGCA CGGCTTCAAG GTCGTAGCCG ACGAGGTCTA CGGTCCCAAA
GACACCGACA TGACCCCGCA GCTCACCAAG ATCAAGGCCG CCAAGCCTGA CGCCATCATC
TGCTGGGGGA CCAACCCCGG CCCGGCCATC ATCACCCGCA ACGTGCGGCA GTTGGGCATC
AAGGCCCAGC TCTATCAAAG CCACGGCGTC GCCTCCAAGA AGTACATCGA ACTCGCCAGC
GCCCAGGCCG CTCAGGGGGT CATGCTCCCG GCCGGAAAGC TCGCCGTCTT CGACCTCTTG
AAGAAAACCG ACCCGCAGGC GAAGCTCCTC AAGGATTACA ACGACTCCTA CAGGAAGGCC
TACGGCGTCG AGGCGTCCAC CTTCGGCGGC TACGCCTACG ACGGCTTCCT GCTCGTCGCC
CAAGCGGTGA AAAAAGGCGC CTTCACTCGG GCGCAGATCC GCGACGGCAT CGAGAAGGGG
GGGAGCATGG TCGGGGTGTC CGGCATCTTC AAGATGACCC CCAAGGACCA TAACGGTCTC
GACCTCTCCG CCTTCGAGAT GGTCCGCATC GACAAAGGCG ACTGGGTGAT CGTACGCTGA
 
Protein sequence
MRKILVVLGI TMLLCSASAA LAAAPIKIGG LFAVTGPAAF LGEPEKKTLE LLVKEANAKG 
GINGSKIELV VYDTGGDVTK AVQLANKLIK NDKVSAIVGP STTGETMAVI PIAEKEQVPL
ISCAAGIKIT APVKKWVFKT PANDHVAAEK ILLQAARLKQ KNIAILTVSD SFGSSGREQL
KQMAAKHGFK VVADEVYGPK DTDMTPQLTK IKAAKPDAII CWGTNPGPAI ITRNVRQLGI
KAQLYQSHGV ASKKYIELAS AQAAQGVMLP AGKLAVFDLL KKTDPQAKLL KDYNDSYRKA
YGVEASTFGG YAYDGFLLVA QAVKKGAFTR AQIRDGIEKG GSMVGVSGIF KMTPKDHNGL
DLSAFEMVRI DKGDWVIVR