Gene Nther_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_2078 
Symbol 
ID6316073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp2197651 
End bp2198850 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content39% 
IMG OID642644466 
Producttransposase IS111A/IS1328/IS1533 
Protein accessionYP_001918233 
Protein GI188586688 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.218145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTTTG TTGGGATTGA TTGGGCTGAT ACAAAACATG ATATCCTGGT CATGAGTGGC 
GATGGTAGAG AACTAGATAA CTTCACTATT CAACATTCTC AAGATGGATT TGAAACTTTA
GGAACTAAAC TTCTGAAACA TGACAACAAT CCTGAAAACT TCTGCTGCTT AATTGAAACC
AAACATGGAC TTTTAACTCA ATATCTTTTA GAAAATAACT TCACTGTTTA TTCTGTTAAC
CCCAAGCTAG TTGATGCTAG ACGAAAAGCT TCCGGGGCTA AAACTGACTT TATTGATGCT
AAAATACTAG CTAATATGGG CAGATCAGAG CTCCATGACT TACATAAGCT AGAGCCTGAT
TCGGAACATA TCCAAGAGCT TAAAGTACTC ACCAGAGATC AAGACGCCCT TATACAAGAA
AGTACTAGGC TAACAAATAG GCTGATTTCA ACACTGAAAG AATACTATCC TGTAGCTCTT
GAATTATTTT CTAAAATAAC TCTACCTATC TCTCTAGCTT TCTTAAGAAA ATATCCTACT
CCAAAACAGG CTCGTAAAGC TAGCAGAGAT GATATCTACA AGTTTTTGAA AAAGCAAAAT
CATCCTAACC CTTTATCTAA AGCTAATGAA ATATTCACAA AGCTTCAAAG ACGTAATTTA
GAAGGTAACA GGGCTATTTG TTCTGCCAAG TCTAAGTTTT TATTTACTAT CCTTGATCAG
CTAGAGCCTT TATTAGAGCA CATTAAAGAG TATGACAGGG AAATTGAAAA ACTTTTTAAG
TCCCACTCTG ACAGTAAACT TTTTGAAAGC TTGCCAGGTG CCGGTAAGCG TATAGCACCG
AGGCTGCTGG CAGAGTGGGG AGATGATAGA AGCCGTTATG CTGACGCCTC GGTAGTCCAG
GCCCTTGCGG GAACTTCACC AGTACTACAT CAAAGTGGCA AAATGCGTAT TGTGAAAAGG
CGGCATTCTT GTATTAAACC TTTTAGAAAC GCTTTGCATC AATTTGCTCT TCAAACTGCG
AGGTGGGTCC CCTGGGCCAG AGATTATTAC CTCAGAAAGC GAAAAGAAGG CAAACAGCAT
CATGAGGCTG CAAGGGCTCT AGCTAATATT TGGGTCAGGA TACTCTATGC TATGTGGCTG
AACAAAGAAC CCTACAATGA AAACAAATTC TTAAAAGCTA GAGAAAAACA CGCTGCTTAA
 
Protein sequence
MYFVGIDWAD TKHDILVMSG DGRELDNFTI QHSQDGFETL GTKLLKHDNN PENFCCLIET 
KHGLLTQYLL ENNFTVYSVN PKLVDARRKA SGAKTDFIDA KILANMGRSE LHDLHKLEPD
SEHIQELKVL TRDQDALIQE STRLTNRLIS TLKEYYPVAL ELFSKITLPI SLAFLRKYPT
PKQARKASRD DIYKFLKKQN HPNPLSKANE IFTKLQRRNL EGNRAICSAK SKFLFTILDQ
LEPLLEHIKE YDREIEKLFK SHSDSKLFES LPGAGKRIAP RLLAEWGDDR SRYADASVVQ
ALAGTSPVLH QSGKMRIVKR RHSCIKPFRN ALHQFALQTA RWVPWARDYY LRKRKEGKQH
HEAARALANI WVRILYAMWL NKEPYNENKF LKAREKHAA