Gene Nther_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2048 
Symbol 
ID6315566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2163651 
End bp2164676 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content39% 
IMG OID642644436 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001918203 
Protein GI188586658 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0715987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00000000000901879 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGAGTTA TCAATAGTAT TATCAAAAAA TGTGTTTTTG CTATACTTTT TGGCATGTTA 
GTTTTGGGAG TAACCGGTTG TGCAGACAGC GAAGCTCAGG ATGAGGTTGA ACTAACTTTT
GCAGATGCAG GATGGGAAAG TATTAGGGTA CATAACTATA TAGCAGGGAT TATTTTGGAA
GAAGGATATG GTGGATATAG ACAAGATATA ATGTCTGGAT CTACACCGGT AACCTTTACT
GATTTACGCG GTGGCGGAAT TGACATTTAC ATGGAAGTGT GGAAAGAAAA TATCCAAGAG
GAGTACAATG AAGCCCTAGA AAAAGGTGAA ATCCAGGTCC TGTCGATTAA TTTTGATGAC
AACTTTCAAG GATTATATGT ACCCACCTAT GTTATAGAAG GTGATGAGGA CCGGGGAATC
GATCCTATTG CGCCTAATTT AGAGTCTGTG TTTGATTTAC CTGATTATTG GGAAGAGTTT
CAGGATCCGG AAGACCCGGA TAAAGGCCGG ATAATAGGGG CTCCCTCCGA ATGGGCAGTA
GATGAAATCC TTGAAATTAA GGTAGAAACT TACGGATTAG ATGAACACTT TAATTATGTG
AGTCCTGGTT CAGAATCTAC GTTGAATGCA ACTATAATGG ATGCCTATGA AAGTGGCGAA
CCAGTTGTGG CGTATAACTG GGAACCTACA TGGATTATGG GTAAATATGA CATGACTTTA
TTAGAAGAAC CAGAATTTGA TGAGGAAAAA TATTACGAAG AAGGATATGG AACTGAAATT
CCTTCTATGG ACGTTACAGT AGCAGTTAAT TCTGATTTAG CAGAGGAACA TCCCGAGGTA
GTTGAGTTTT TGGAGAATTA TGAGACTAGT AGTGAAGTAA CAAGTGAAGC TTTGGCTTAT
ACGGAAGAAG CTGATGCAGA TGAACGAGGA GCAGCAAAAT GGTTTTTAAG AGAATATGAA
GAGATTTGGA CTGAATGGGT TAATGAAGAA GTGGCTGAAA ATGTTAGAGA ATATTTACAG
CAATAA
 
Protein sequence
MRVINSIIKK CVFAILFGML VLGVTGCADS EAQDEVELTF ADAGWESIRV HNYIAGIILE 
EGYGGYRQDI MSGSTPVTFT DLRGGGIDIY MEVWKENIQE EYNEALEKGE IQVLSINFDD
NFQGLYVPTY VIEGDEDRGI DPIAPNLESV FDLPDYWEEF QDPEDPDKGR IIGAPSEWAV
DEILEIKVET YGLDEHFNYV SPGSESTLNA TIMDAYESGE PVVAYNWEPT WIMGKYDMTL
LEEPEFDEEK YYEEGYGTEI PSMDVTVAVN SDLAEEHPEV VEFLENYETS SEVTSEALAY
TEEADADERG AAKWFLREYE EIWTEWVNEE VAENVREYLQ Q