Gene Nther_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1838 
Symbol 
ID6315665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1913792 
End bp1915210 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content36% 
IMG OID642644216 
Producttranscriptional regulator, ArsR family 
Protein accessionYP_001917998 
Protein GI188586453 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.240001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.27249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGACT ATCAGCTATT AAACAATCCA GAAGCAATTA AAGCATTAGG TCATCCCCTT 
CGAATGAAAA TAATTGATTT ACTAACTCAA AAAAAGGCTT CAGTTGGACA GATTTCTCAG
GAACTTGATC TGGCTCATGC TAAAGCTTTT TACCATGTAA AAGAACTCAA AAAGCTAGGT
TTGATAGAAC TAGTAGATAC CAGAATGATT CAAGGTATTC AGGAAAAATA TTATCAAGCC
GTTGCGCAAA CTTTTTTCTT AGGGCAATCC TTGGGTCAGG GTCCCTCTGA AAGCATTGAC
AATGCTAGTC AGGCAGTTCA AGGAAGTTTA AGAGAATGGC GGAGAAGGCA GATATTAAAC
GTTGATTTAG AAAATCTGGC TCAGAAGGTT ATAACAGACG TATTGGCACT TAAACCCGGG
GAAAAAGTAC TTTTTAGTGG TGAAGCGGAA GTGATGGATT TTTGTCACGC TATGACTGTG
AGTTGCCGTA AAGCAGGAGG AGAAGGAATG GTCCATAATA TTGATTTAGA AACATTTGCA
ACCATGATAT CTGAAACACC ATTGGAAATA TTGAAGGAAA CCCCACCTTT AACCGAAGCT
TTATACCGAG AGTTAGATTA CTGGGTTGTT TTTGTACCTT TAATACCAGA AGATTATTTG
AAAGAAGTAT CCTTAGAGAA AATAGAAACT TTAAAAAAAG TAGATGCTCA ATTACATTAT
AAATATTGTA CCGATTTAAA AACCGTCTTT GTCGCTTATC CTTTACCACA GCTTTCGCAT
CGTTATTTAG TAGATTATCA AGTTTTATAT GATGCCTTTT GGAAAGGCAT GAACGTCAGT
AGGCAAAGAA TAAAAAATGA AGCTAAAAAT ATTGAAGAAA TTTTAAAAAC TGGAAAAACA
TACTCAATTT GGAATGAACT AGGTACACAT TTACAATTTA AACTTAAAGC TGATTCACAA
CCAGCTTTAG ATAGCGAGTT ATTTCAGGAT AAAAAAAATG GCGGTGAAAT AACCTTACCT
GAAGGTGTTA TCTTTTCATT TCTTGATGAA GAAACAGTAT CTGGTCAGAT TGTAGTGCCG
CGTAAAGAGT TTCGGGGAAA GCTAATTTAT AATCTTAAAA TATTTATTGA GAGCGGTCAT
GTAACAGCCA TTGAAGATTC GTCTTCTTGT CCTGAAGGTT TACTACAATA CTTAAAAGAA
ATGCCTGATT TAAAAAAGGT GACCGCTTTA GGCATTGGTG TTAATCCAGA AATACAAGGT
GATGAGTTAC CAGAAAACTT ATTACTTAGA AGTCCAGGGC AATTCCAGGT TATTTTAGGA
GATAACTCCA GACTTGGAGG GACAGCCAGG GCATCCACCT GGTTGTCTAT GCCTATTGGT
AGAGTGGAAA TAGAACAAAC AGACGGATCC TTCGCATAA
 
Protein sequence
MKDYQLLNNP EAIKALGHPL RMKIIDLLTQ KKASVGQISQ ELDLAHAKAF YHVKELKKLG 
LIELVDTRMI QGIQEKYYQA VAQTFFLGQS LGQGPSESID NASQAVQGSL REWRRRQILN
VDLENLAQKV ITDVLALKPG EKVLFSGEAE VMDFCHAMTV SCRKAGGEGM VHNIDLETFA
TMISETPLEI LKETPPLTEA LYRELDYWVV FVPLIPEDYL KEVSLEKIET LKKVDAQLHY
KYCTDLKTVF VAYPLPQLSH RYLVDYQVLY DAFWKGMNVS RQRIKNEAKN IEEILKTGKT
YSIWNELGTH LQFKLKADSQ PALDSELFQD KKNGGEITLP EGVIFSFLDE ETVSGQIVVP
RKEFRGKLIY NLKIFIESGH VTAIEDSSSC PEGLLQYLKE MPDLKKVTAL GIGVNPEIQG
DELPENLLLR SPGQFQVILG DNSRLGGTAR ASTWLSMPIG RVEIEQTDGS FA