Gene GM21_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2038 
Symbol 
ID8137374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2362271 
End bp2363410 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content62% 
IMG OID644869653 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003021848 
Protein GI253700659 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000000000000518039 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGAAAAC TCATTACCAC TGCTCTTGCT TTATTTACCT TGCTCTGCCT TTCCACCGTG 
GCCCTGGCCG CCGCTCCCAT CAGGATCGGT GGACTTTTCG CCGTGACCGG CCCAGCCTCC
TTCCTGGGGG AGCCTGAGCG CAACACCGCG CAGATGGTGG TCAACGAGAT CAACAAGGCG
GGCGGCGTAA AGGGTCGCAA GATCGAACTG ATCACCTACG ACACCGGCGG TGACGCCACC
AAAGCGGTGC AACTCGCCAA CAAGCTGATC AAGAACGACA AGGTCGTCGC CATCATCGGT
CCCAGCACCA CCGGCGACAG CATGGCTATC ATCCCCGTGG TCGAGAGAGC CCGGATACCG
CTCATCTCCT GCGCAGCCGG GAGCAAGATC ACCGAGCCGG TGAAGAAGTG GGTCTTCAAG
ACCGCCCAGA ACGACGGCCT GGCAGCCGCC AGGATCTACG AGCAGTTGAG GAAGGAGAGG
AAGACCAAAG TGGCCATCCT GACCGTCTCC GACGGATTCG GCTCCTCCGG GCGCGAGCAG
TTGAAGGCCC AGGCGAGGGT CTACGGCATC CAGATACTTT CCGACGACAC CTACGGCCCG
AAGGACACGG ACATGACCGC GCAGCTCACG AAGATCCGCG GCTCTCAGGC GCAGGCGGTT
ATCTGCTGGG GCACCAACCC CGGCCCCGCC GTGGTGGCGA GAAACGCGAA GCAGCTCGGC
CTCAGGATCC CGCTCTACAT GAGCCACGGC GTTTCCTCCA AAAAGTTCAT CCAGCTTGCC
GGGGACGCGG CCGAGGGGGT CAGACTTCCC TCCGGCAAGG TCCTGGTCGC CGACCTGCTG
CCCAAGAGCG ACAGGCAGAA GGGGTCGCTC CTTGCCTTCA TCAAGGACTA CCAGAACCAT
TACAGGGCCG AGGGAGACCA CTTCGGCGGC CATGCCTGGG ACGCGGTGAT GCTCCTGAAA
GGCACCATCG AGAGGGGAGG GGACACCCCT GTGGGGATCC GCAACGCGCT GGAGGCAACC
CGCAACTTCG CCGGCATCGG GGGCGTTTTC AACTATTCGA CCAGGGACCA CGCCGGCCTG
ACGAAAGACG CCTTCACCCT GGTTGAAGTC CGGAAAAAAG ACTGGGTGCT GGTCAAGTAA
 
Protein sequence
MRKLITTALA LFTLLCLSTV ALAAAPIRIG GLFAVTGPAS FLGEPERNTA QMVVNEINKA 
GGVKGRKIEL ITYDTGGDAT KAVQLANKLI KNDKVVAIIG PSTTGDSMAI IPVVERARIP
LISCAAGSKI TEPVKKWVFK TAQNDGLAAA RIYEQLRKER KTKVAILTVS DGFGSSGREQ
LKAQARVYGI QILSDDTYGP KDTDMTAQLT KIRGSQAQAV ICWGTNPGPA VVARNAKQLG
LRIPLYMSHG VSSKKFIQLA GDAAEGVRLP SGKVLVADLL PKSDRQKGSL LAFIKDYQNH
YRAEGDHFGG HAWDAVMLLK GTIERGGDTP VGIRNALEAT RNFAGIGGVF NYSTRDHAGL
TKDAFTLVEV RKKDWVLVK