Gene Nther_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1003 
Symbol 
ID6316568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1065800 
End bp1067461 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content34% 
IMG OID642643375 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001917175 
Protein GI188585630 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.399191 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA ATGATAATAG AAGAAATAGA AATGAGGATC AAGAAATAGA TAGAGATAAA 
ATAGAAGAAT TATTGTTCAG TAGAAATACA AAGAAATTTG GTTTTGATTT AAATCCAGTG
GTTTCCTTAG GTGCAGGTAT AATAATTATA ATTTTTTCTG CTTTTGCTTT GATCAATCTT
GAAAGAGCTA ATATTTTAGT TAATTCAGTC AATGAAATCA TAGTGACAAA CTTTGATTGG
ATTTTTATTC TCTCGAGCAG TTTTTTTATT TTGATTTGCA TATATATAGC TTTTTCCAAA
TTAGGTAAAG TCAAAATAGG TGGATTAAAT GCCGAGCGAG AATTTAGTAA CTTTGCCTGG
TATTCCATGC TAATATCAGC AGGGATGGGT ATTGGACTGA TGTTTTGGGC AGTTGGTGAG
CCCTTAACTC ATTTTGAAGT TTTACCTCCA GTATTTGATA GCCCTTTCGA TAAACATACT
GCCATGGCCA CTACTTTTTT TCACTGGGGC TTACACCCTT GGGCAGTTTA TTCTTTAATA
GCTTTAGCTT TAGCCTTTTT CGCTTATAAT AAGCATTTGC CATTATCAAT TAGATCAGTC
TTTTATCCTT TTTTTAAAGA AAGAGTGTAC GGAACCCTAG GTGATGTTAT CGATACTTTA
GCGGTGCTTT CAACTCTGTT TGGATTAGCT ACTTCCTTGG GTTTAGGGGC TCAACAAATA
AACAGTGGTC TTGATTATTT ATTTGATATT GGTTTTAGTG TTAACATCCA AGTAGCTTTA
ATAATTGGAA TCACATTATT AGCTACCATT TCCGTATTAT CGGGTATAGA TAAAGGAGTA
AAGTTTCTGT CAAAAATGAA TATTAGATTG GCTGCAATTT TAATGGGAAT TATCTTATTA
TTGGGACCTA CTGGTTTTAT CTTAAGGCTT TTTTCCAATT CTTTGGGCTT ATATTTTAAT
AATATTATTG AATACTCTTT CTTTATTGCA GTAGAAGAAA CAGGGTGGCA AGCAAATTGG
TCGATCTTTT ATTTAGCTTG GTGGATATCC TGGTCTCCCT TTGTTGGAAT GTTTATTGCA
AGAATTTCAA AAGGTAGGAC CATCAGGGAA TTGATCTTAG GAGTAATGAT AGTACCGTCA
CTGTTATCAT TTTTATGGTT ATCTGTTTTT GGTGGTAGTG CAATTTTTAT TAACGAACAA
GTAGGAGGTT TGTATGAAGT AGTTCAGGAT GATTTACCTG TAGCCTTGTA TGAGTTGGTT
AATTTATTAA ATTTACCATT ATTAGCCGAA TTATTTAGAA TTTTATTGTT TATATTAATA
ACTTTCTTAG TGGCAGTATA TTTCATAACC TCCTCTGATT CGGGGTCACT AGTAGTTAAT
AAAATTACAT CAAGTGGTAA ACTTAATACT CCGGCAAATC AACGTGCTTT TTGGGCAATT
TTAGAGGGTT TGTTAGCAGC AGTTCTGTTA TTGATTGGTG GAGAAAAAGC TTTACTTGCT
CTACAGACAG CAGTAATTAG TACTGGACTT CCATTTGCAG TGGTATTAAC CGCCATGGCT
TTTGCCTTGA TTAAGGGAAT TGAAGATACT CGTCGAGAAC AAAAACGGAA ACGGGAGCGG
AGAAAATTTG AAAAACTTTT AAAAGCTCAC GACCAAGAAT AA
 
Protein sequence
MAANDNRRNR NEDQEIDRDK IEELLFSRNT KKFGFDLNPV VSLGAGIIII IFSAFALINL 
ERANILVNSV NEIIVTNFDW IFILSSSFFI LICIYIAFSK LGKVKIGGLN AEREFSNFAW
YSMLISAGMG IGLMFWAVGE PLTHFEVLPP VFDSPFDKHT AMATTFFHWG LHPWAVYSLI
ALALAFFAYN KHLPLSIRSV FYPFFKERVY GTLGDVIDTL AVLSTLFGLA TSLGLGAQQI
NSGLDYLFDI GFSVNIQVAL IIGITLLATI SVLSGIDKGV KFLSKMNIRL AAILMGIILL
LGPTGFILRL FSNSLGLYFN NIIEYSFFIA VEETGWQANW SIFYLAWWIS WSPFVGMFIA
RISKGRTIRE LILGVMIVPS LLSFLWLSVF GGSAIFINEQ VGGLYEVVQD DLPVALYELV
NLLNLPLLAE LFRILLFILI TFLVAVYFIT SSDSGSLVVN KITSSGKLNT PANQRAFWAI
LEGLLAAVLL LIGGEKALLA LQTAVISTGL PFAVVLTAMA FALIKGIEDT RREQKRKRER
RKFEKLLKAH DQE