Gene GM21_2283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2283 
Symbol 
ID8137623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2662568 
End bp2663755 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content60% 
IMG OID644869898 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003022090 
Protein GI253700901 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.000000119381 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTTA AGAAGATCAG CGCATTGCTC TTGGTAGCGA TGCTGGCGGC CGGCGCATTC 
GGCTGCAAGA AGAAGGAAGA GGCCCCGGGT GCAGAGGCAG CGAAACCGGC CGGTGACACG
GTGAAGATCG GTTTCCTCGG CGCTCTCACC GGCGATGTGG CCATGTTCGG CAAGCCGACC
CTCGAAGGTA TGAAGATGGC TGCCGCCGAA CTCAACGCAG CCGGCGGCGT TCTCGGCAAG
CAGATCGAGA TCGTAGAGGC CGACAACCGC GGCGACAAGC AGGAAGGCGC CTCGGTCACC
CAGAAGTTCA TCTCCCGCGA CAACGTGACC GCCATCGTCG GCGATCCGAC TACCGGCATC
ACCAAGGTTG CAGCTCCCAT CGCGCAGAAA GCAGGCGTGG TGCTTCTTTC CGCCGGCGCC
ACCGGCCCGG GCGTGGTCGA AGTGGGCGAT TTCATCTTCC GCGACACCCT GCTCGACTCC
ATCGCGATTC CCGCCTGCAT CGAGTATTTC GCAAAGGATC TCGGCTTCAA GAAAGTCGCC
ATCGTAACTT CCGACAACAA CGACTACTCG GTCGGCCTTT CCCAGACCTT CCGCGATGCA
GCCGCCAAGG TCCCCTCGAT CACGATCGTA GCGGACGAGA AGGTGAAGGA CGGCGACAAG
GACTTCTCCG CTCAGATCAC CAACATCAAG GGGAAAAAGC CGGACGTCAT CCTCTTCTCC
GGTTACTACA CCGAAGGTGC TCTCATCATG AAAGAGGCCC GTAAGCAGGG TCTGAAGGCT
TCGATGTTCG GCGGCGACGG ACTCTTCTCG CCCAAGTTCA TCGAGCTCGG CGGCCCGGCT
GTCGAGGGCT CCATGTCCGC TCTGGGCTTC TCCACCGAGC AGGCTTCCCC TGCGACCGCC
AAATTCATCG AGGCGTTCAA GGCGAAGCAT AACGGCGAAC TCCCGGGCTT GTTCGACGCT
CAGGGCTACG ACGCAGTGAT GCTCTTGGCC GACGCCATGA AGCGCGCCAA CAGCGTCGAC
GCCAAGGTCT TTAAAGATGC CCTTGCCAAG ACCAAAGGCT TCGAAGGCGT TTCCGGCACC
ATCAGCATGC AGGCCAACCG CGAGCCGATC AAGAGCCCGC TCTCCCTTCT CGCTGTGAAG
GACGGCAAGT TCGTGCTCAA GGCAAAAGTC CCCGTCAAAA TGGACTAA
 
Protein sequence
MSFKKISALL LVAMLAAGAF GCKKKEEAPG AEAAKPAGDT VKIGFLGALT GDVAMFGKPT 
LEGMKMAAAE LNAAGGVLGK QIEIVEADNR GDKQEGASVT QKFISRDNVT AIVGDPTTGI
TKVAAPIAQK AGVVLLSAGA TGPGVVEVGD FIFRDTLLDS IAIPACIEYF AKDLGFKKVA
IVTSDNNDYS VGLSQTFRDA AAKVPSITIV ADEKVKDGDK DFSAQITNIK GKKPDVILFS
GYYTEGALIM KEARKQGLKA SMFGGDGLFS PKFIELGGPA VEGSMSALGF STEQASPATA
KFIEAFKAKH NGELPGLFDA QGYDAVMLLA DAMKRANSVD AKVFKDALAK TKGFEGVSGT
ISMQANREPI KSPLSLLAVK DGKFVLKAKV PVKMD