Gene GM21_1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1853 
Symbol 
ID8137184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2155962 
End bp2156948 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content64% 
IMG OID644869464 
ProductTRAP transporter solute receptor, TAXI family 
Protein accessionYP_003021664 
Protein GI253700475 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.45766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC ACCGGCGTTA CCCGCGCTTA CTGGCGCTCC TTGTGACAGT CACCGTCCTG 
TTCGTCCCCT GTTCTCTCAC CGCCTTTCAG TATCCCTCAC TCACCATCTC CTCCGGGACC
ACCACCGGTT CCTATTATGC CGCCGCCAGT GCCATAGCGA AGGTGTTCAA CCCCAGCAGC
GGCCGCAACG GCGTGAGGCT CGCCACGGTC GCCTCGCCCG GGTCGGTGGC CAACATCGAC
CAGGTCGCCG ACGGCAAGGC CGCCTTCGGC ATCGCCGAGA CGGAGCTTTT GAAGCGGGCC
ACGCTGGGGG TGCGACCCTG GGAAGGGAAG GCGCGCACCG GCCTGCGCGC GATATTTAGC
ATCTACGTCG AGAGCGTCAC CGTCGTCGCC GCGGTCGACA GCGGCATCAA GCGGGTGAGC
GACCTGAAGG GGAAGCGGCT GAATATCGGC GCGCCTGGCT CGATAGACAA CACCTATGCG
GCCGCTTTCC TGCAGATGTC CGGGCTGAAC CCTGGGCTGG TGGTCACCTC GCAGCACTCA
ACCGCGATCG CGCCCGAACT GTTGCAAAAA GGAGAGATCG ACGCCTACCT CTGCATCGTC
GGCCATCCGA ACCTGACCGT GCTGGAAGCG AGCGCAGGCA AGCGCAAGGT CACCTTGATA
TCCCTGGACA ACGCCCTGAT CCAGCAGGTG GTCGGCCACA ACCCGCTGCT GATGGCCGTC
GCCATACCCA CCAACTTCTA TCCCAGAGTC GAAGTCAGCG GCAAGGTCCC CACCATCGGC
CTGCGCGCCG TTCTCTTCAC CTCGGCCGAT CAGCCCGAGG AAATCGTGTA CGCGGTGGTG
CGGGAGGTCA TGTCCAACCT CGACCTGTTC CGCCGCCAGC ATCCCATCCT GCAGAATCTC
TCCCCGCGGG ACGCCGCAAA GGTCGGGGCC ATTGCGCTTC ACCCCGGCGC CCTCCGATAT
TTCAAAGAGG CAGGCCTCGT TCCCTGA
 
Protein sequence
MTRHRRYPRL LALLVTVTVL FVPCSLTAFQ YPSLTISSGT TTGSYYAAAS AIAKVFNPSS 
GRNGVRLATV ASPGSVANID QVADGKAAFG IAETELLKRA TLGVRPWEGK ARTGLRAIFS
IYVESVTVVA AVDSGIKRVS DLKGKRLNIG APGSIDNTYA AAFLQMSGLN PGLVVTSQHS
TAIAPELLQK GEIDAYLCIV GHPNLTVLEA SAGKRKVTLI SLDNALIQQV VGHNPLLMAV
AIPTNFYPRV EVSGKVPTIG LRAVLFTSAD QPEEIVYAVV REVMSNLDLF RRQHPILQNL
SPRDAAKVGA IALHPGALRY FKEAGLVP