Gene Nther_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1622 
Symbol 
ID6314769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1700700 
End bp1701911 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content38% 
IMG OID642643998 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001917784 
Protein GI188586239 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000303127 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000000924799 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGACAA AAACGCAAAT CGAAGTGAAC AATGTTTACA AAATTTTTGG ACCAAGACCA 
GATCGAGTCT TTCCACTACT TGAAAAAGGT ATGACCAAAG ATGAAATTTT AAGAAAAACA
GGAAACACTG TCGGAGTAAA CGATGCAAGC TTTGATGTTA AAGAAGGCGA AATCTTTGTA
ATCATGGGTT TATCTGGTAG TGGCAAGTCA ACACTTATTA GGTGTATTAA TCGACTCATT
GAACCAACTA GAGGCGAAGT CCTGATCGGT GGAGAAGATA TTTTACAGAT GGATAATGAA
AAACTTAGAC AAGTAAGACG TGCCAAACTT GGCATGGTAT TTCAGCACTT CGCTTTGTTT
CCTCACAGAA CTGTACTTGA TAATGTTACC TACGGTTTAG AAGTTCAAAA TGTTGATCAA
GAAAAACGCC AAGAAGTAGG CATGAAGGCA TTAGATCAAG TTGGCTTAAA AGATTATGCA
AAATCAAAAC CAAGCGGCTT GAGTGGTGGA ATGCAACAGC GTGTCGGCCT TGCCAGGGCA
TTAGCATTAG ATCCAGACAT TTTACTAATG GATGAACCAT TTAGTGCCTT AGACCCGCTG
ATTAGGCGCG ATATGCAGAG TGAATTGTTA GAATTACAAT CAAGAGTTAA TAAAACAATT
TTATTCATTA CACACGACTT GGACGAGGCT TTGAAACTAG GGGACAGAAT CGCTATCATG
AGAGATGGCG TTATAGTGCA AATTGGTGAG CCAGAAGAAA TTCTTTCAAA TCCTGCTAAT
GAGTATGTAG AAAACTTTGT CAGAGATGTT AACAGACTAA AAATACTTAC TGCAGGAAGT
ATAATGGAGA AACCTGATGT AACAGTAAAT ATCAGCGACG GTCCTAGAAA AGTTCTTCGA
GTATTGGAAA AAGAAGGTTT CTCCAATGCT TATGTTGTAG ATCGTCAAAA ACGAGTTAAA
GGTGTAATTA AAGATACTGG AGCTTTAGAG GCTCTAAAAA ACAATGAAAA AACAATTGAA
AACTATCTGA TCACTGATTA TCCATCAACA TCAGAAGATA CTCCACTTAA TGAACTGTTA
CAAACAGCTT CGGAGTCAGA CTACCCTATA GCAGTCGTAG ATGAAGAAGA TAAATTGCAA
GGGCTGATTG TCAGGGTATC AGTTCTGGCC TCATTAGCTG AAGGAGAAGG AAGTGATAAC
GATGATTCCT AG
 
Protein sequence
MVTKTQIEVN NVYKIFGPRP DRVFPLLEKG MTKDEILRKT GNTVGVNDAS FDVKEGEIFV 
IMGLSGSGKS TLIRCINRLI EPTRGEVLIG GEDILQMDNE KLRQVRRAKL GMVFQHFALF
PHRTVLDNVT YGLEVQNVDQ EKRQEVGMKA LDQVGLKDYA KSKPSGLSGG MQQRVGLARA
LALDPDILLM DEPFSALDPL IRRDMQSELL ELQSRVNKTI LFITHDLDEA LKLGDRIAIM
RDGVIVQIGE PEEILSNPAN EYVENFVRDV NRLKILTAGS IMEKPDVTVN ISDGPRKVLR
VLEKEGFSNA YVVDRQKRVK GVIKDTGALE ALKNNEKTIE NYLITDYPST SEDTPLNELL
QTASESDYPI AVVDEEDKLQ GLIVRVSVLA SLAEGEGSDN DDS