Gene Nther_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2053 
Symbol 
ID6315571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2169997 
End bp2170941 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content40% 
IMG OID642644441 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001918208 
Protein GI188586663 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000296796 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000000325695 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGCTC ATGGAAGCGG AAAACGTAAA GTTTTAATAG TAGCAATGCT TATTACAGGA 
ATTATTTTTG CAGGGATTGC TACCGGATGT GAAGAAGAAA GAGATGTAGA ACTTGCTATG
GTCGAATGGA CATGCTCTAC TCAGAAGAGT CATATCAACG AAGCTGTATT AGAGACATTA
GGTTACGATG TTAACGTTAA GACTTACAAT CTCCCTGTAA TCCTTGAAGG AATGGCAGAT
GGGCAAATTG ATGCCTTTAC AGATGCATGG TTTCAAACTT GGGGAACCCC CCTTGAAAAT
GCTTTGGAAG AAGGGGATGT AGTTCATTTA GAAACTCATT TAGATGAAAC TAATTACGCG
CCAGCTGTTC CCACTTATGT ATATGAGGAA GGGGTAACCT CCCTAGAAGA TTTAGCCGAT
CACTCGGAAA AATTTGAGTA TACTTATTAT GGCTTGGAAC CAGGGAATGA CGGTAATGAG
ATTATGATCG AAGCTTTTGA AAATGATACC TACGGTCTAG GTGAATGGGA TATCATGGAA
AGTAATGAAG CTGCTATGAT CGCTGATGTT GAGCAAAAGA TAGAAAATGA AGAATGGGTA
GTCTTTAGCG GTTGGGAACC CCATTACATG AATGTAATAT TCGATATGGA ATATTTGGAT
GATCCCAAAG GAATTTGGGG TGAAGGTGAG CAAGTTGGTA CCATTGCAAG ACCTGGCTTA
GAAGATGACA ATCCACAACT AGCTCAATAT TTGAAACAAT TTGACGTAGA TGTAGACACT
GTTGACGAAT GGGTTTACGA ATACGGTTAT GAAGACCGTG ACCCAGACGA AGTTGCGGAT
GAATGGATTA GCGAGAACTT AGATAAGGTA TTAGAGTGGG TTGATGGATT AGAAACTGTC
GATGGACAAG ATGCTCAAGA AGCATTGCGT GAAGCTTACG AATAA
 
Protein sequence
MNAHGSGKRK VLIVAMLITG IIFAGIATGC EEERDVELAM VEWTCSTQKS HINEAVLETL 
GYDVNVKTYN LPVILEGMAD GQIDAFTDAW FQTWGTPLEN ALEEGDVVHL ETHLDETNYA
PAVPTYVYEE GVTSLEDLAD HSEKFEYTYY GLEPGNDGNE IMIEAFENDT YGLGEWDIME
SNEAAMIADV EQKIENEEWV VFSGWEPHYM NVIFDMEYLD DPKGIWGEGE QVGTIARPGL
EDDNPQLAQY LKQFDVDVDT VDEWVYEYGY EDRDPDEVAD EWISENLDKV LEWVDGLETV
DGQDAQEALR EAYE