Gene Nther_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2046 
Symbol 
ID6315564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2161483 
End bp2162667 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content34% 
IMG OID642644434 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001918201 
Protein GI188586656 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000615426 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0000000000430181 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCACTA ATATGAGTGA AAAAATTAAA GTTAAAAATT TGTCAAAGAT TTTCGGTCCT 
AAGCCAAAGT CAGTTTTACC CTTACTTGAA AAAAATCAGT CGAAAGAGGA CATTCTTTCA
AAAACTGGAC ATACAGTTGG TGTAAATAAT GTTTCTTTTG ATGTCAAAGA AGGAGAAACC
TTTGTTATTA TGGGATTGTC TGGTTCAGGT AAGTCTACAT TAATCCGGTG TCTTAATTTA
TTAAATAAAC CTACTACCGG AGAAATTTAC GTAAACGAGG ATAATATTCT AGAATTTGAT
AAACAAAAAT TAAGGGATTT TAGACAGAAT CAATTATCTA TGGTTTTTCA ACACTTTGGT
TTATTTACTC ATAGAACAGT ACTAGAAAAT GTAGAATTTG GTTTAGAAAT TAAAGGTGCA
AGTGAACAAG ACAGAAGAGA ATTAGCGAGA AAGACTCTTG AATCTGTCGG TTTAAAAGGC
TGGGAGGATA AAATGCCCAG CGAATTGAGT GGCGGAATGC AGCAAAGGGT TGGGCTCGCC
AGGGCTCTTG CCAATGACCC GGAAGTATTG CTGATGGATG AACCTTTTAG TGCACTAGAT
CCCCTTATTA GAAGAGAAAT GCAGCAAGAA CTGATTGACT TGCAATCAAA TTTAAAAAAG
ACTATTGTAT TCATAACTCA TGATATCAAT GAAGCTTTTA AAATAGGGGA TAGGGTAGCA
GTGATGAAAG ATGGTGTTTT TGAACAAGTT GGTACACCAG AAGAAATCTT AGACAATCCT
GCAAGTGAGT ATATTAAGGA TTTTGTCAAA GACATTGATC GTTCCAAAGT TCTACAAGCT
AAAGATGTTA TGTTCAATCC TTCGGCTATT ATTAATATTA ATGAAGGTTT GAAATCAGCT
GTAAGAGAAA TGCAGACAAA CGGCATTTCA AGTGTATATG TGATTGATAA AAACAAGCAA
TTACTTGGTA TTGTCAGTAT TGATGATGCT ATAGACGCAA TTAAAGAAAA TAAATTTCTA
AGGGATGTTA TAACTGATAA TTACTATACT ACTGATCCGG AGATATACAT CCACGAATTA
ATACCTGTGG CTAAAGATAG TAAATATCCC ATTGCAGTAG TTAATGATAA CAATGAATTG
ATGGGAATTA TTGTAAGGAC ATCGGTGTTG GCTGCTTTAG TATAA
 
Protein sequence
MTTNMSEKIK VKNLSKIFGP KPKSVLPLLE KNQSKEDILS KTGHTVGVNN VSFDVKEGET 
FVIMGLSGSG KSTLIRCLNL LNKPTTGEIY VNEDNILEFD KQKLRDFRQN QLSMVFQHFG
LFTHRTVLEN VEFGLEIKGA SEQDRRELAR KTLESVGLKG WEDKMPSELS GGMQQRVGLA
RALANDPEVL LMDEPFSALD PLIRREMQQE LIDLQSNLKK TIVFITHDIN EAFKIGDRVA
VMKDGVFEQV GTPEEILDNP ASEYIKDFVK DIDRSKVLQA KDVMFNPSAI ININEGLKSA
VREMQTNGIS SVYVIDKNKQ LLGIVSIDDA IDAIKENKFL RDVITDNYYT TDPEIYIHEL
IPVAKDSKYP IAVVNDNNEL MGIIVRTSVL AALV