Gene Nther_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1043 
Symbol 
ID6314224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1106425 
End bp1107711 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content41% 
IMG OID642643415 
ProductSarcosine reductase 
Protein accessionYP_001917215 
Protein GI188585670 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.317745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA CTTTGGAAAA AGTAAGAGTT GATGATTTAG TTTTTGGTGA TCACACCATG 
ATTAACGGTA CCACTTTAAT CGTTAACAAA AACGAAATCA TTAAAAAAGT TAAGGAAGAT
GATCGGATTG CTAAAGTAAA CATAGACATT GTGAAGCCAG GTGAATCTGC TCGCTTATTT
TCTGTAAGAG ATGTAATTGA ACCTAGAGTA AAAGTGGACG ATACGTGTGA ACTGTTTCCT
GGTACGATTG GCAGTGTTGA CCAGGTGGGA GAAGGGGTTA CCAAAGTGTT TAAAGGTGCA
ACGATAGTAA CCACCGGGAA AATTGTTGGA GTTAAGGAAG GTATTATAGA CATGTCTGGT
CCAGGAGCAG AGTACACACC TTTCTCTCAT ACGAATAACC TTGTACTAGA CTGCGATCCG
ATTTCTGGCC TGGAAAGTCG CCAGTACGAA GAGGCTTTAC GGCTGGCTGG CTTGAAAATC
GCCCATTACA TTGGTGAAAA ATGTCAAGAG GCGACAGCCC AGGAAAAAGT CGCCTATGAA
ACGCTACCCA TAGATCAACA GAAGCAAAAG TACCCTGAGC TACCGAAGGT GGGATACATC
TATATGCTTC AAAGCCAGGG ACTATTACAC GATACTTATT TCTATGGTGT GGACGCTAAA
GAGATACTAC CTACTTATAT TTATCCAACA GAAGTCATGG ACGGTGCTAT TGTCAACGGA
AATAGTATTA TAGCTTGCGA CAAGAACACC ACCTACCATC ATTTGAATAA TCCTATCATT
GAAGATTTAT TTGAATATCA CGGTAAAGAA ATTAATTTCT GCGGGGTCAT TATTACAAAT
GAAAACGTTA CTTTAGAGGA TAAAGAGCGT TCTTCAAATT ACACTGCTAA ATTGTCGGAG
CAATTTGGGT TTGATGGTGT CATCATTTCA AAAGAAGAGT ACGGAAACAC AGACACCGAT
TTGATTATGA ATTGCAAGAA GATTGAGGAA AAGGGGATCA AAACGGTACT TGTAACGGAC
GAGTATGCAG GCCGGGATGG TTCTTCCCAA TCCCTAGCAG ACGCTGACCC GAAAGCAGAT
GCTGTAGTGA CTACTGGAAA TGCCAACGAA ACGATCATAT TACCCCCTAT GGACAAAATT
ATTGGTAAAA TCGATTCAGA GGATCTGGAT GCTGGTAATT ATGAAGGGAA CCTCAAGCAT
GATCAGAGTA TCGAAATAGA GATACAAGCT ATTATTGGGG CAACCAATGA ATTAGGTTTC
AACAAGATGG GTGCAACCGA ATTCTAA
 
Protein sequence
MKLTLEKVRV DDLVFGDHTM INGTTLIVNK NEIIKKVKED DRIAKVNIDI VKPGESARLF 
SVRDVIEPRV KVDDTCELFP GTIGSVDQVG EGVTKVFKGA TIVTTGKIVG VKEGIIDMSG
PGAEYTPFSH TNNLVLDCDP ISGLESRQYE EALRLAGLKI AHYIGEKCQE ATAQEKVAYE
TLPIDQQKQK YPELPKVGYI YMLQSQGLLH DTYFYGVDAK EILPTYIYPT EVMDGAIVNG
NSIIACDKNT TYHHLNNPII EDLFEYHGKE INFCGVIITN ENVTLEDKER SSNYTAKLSE
QFGFDGVIIS KEEYGNTDTD LIMNCKKIEE KGIKTVLVTD EYAGRDGSSQ SLADADPKAD
AVVTTGNANE TIILPPMDKI IGKIDSEDLD AGNYEGNLKH DQSIEIEIQA IIGATNELGF
NKMGATEF