Gene Nther_1872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1872 
Symbol 
ID6315043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1959517 
End bp1960587 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content31% 
IMG OID642644254 
Productputative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
Protein accessionYP_001918032 
Protein GI188586487 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000802014 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.278856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGGCA AAATGAGTTT ATGGGTATTA ATTCTTAATT TTTTATTAAC TATGAATCTT 
GGGGGTTATA ATATAAACCA TATCGATTGT AACCTAATAC ACGTTTTTGA AGAAATGAAA
AAAGAGTATC AAACTAAGGA AGAAGTCGTA AACAACGATG AAGTAAAATC TATAATTATA
AGTGTTGCTG GTGATACAAC TCTTGGTTAT GATGAAGATT TTGGTTATTA CAATAGTTTT
GATCATGAAT TTGAAAGACA AGGCAAAAAT TATAATTATT TTTTTAGCAA TGTAAAAGAA
ATTTTTAAAG ACAGCGATAT TTCAATTTTA AATTTAGAAG GTACCTTAAC AAATCATGAT
CAACCTAAGA ACAAAAAGTT TACTTTTAAA GGTAAACCAG AGTACGCCAA AATTTTGAAA
AAAGGGCATA TTGATGCAGT AAACCTGGCA AATAACCACA CTATGGATTT TGGAAACAGA
GGGTTTCAAG ACACTAAAAA ATCTTTGGAA CAAAAAGGAA TTGGCTACTT CGGCTATGAC
CTGGAATTTA CCAAAGAAGT GAAAGGTAAA AAGTTTTCTA TATTAGGATT CACCGGATGG
TACGTTAATC AGGAGCGAAA AAATTATTTG AGTTCAAGAA TAGAACAGGC AAAAGCAAAC
TCTGATGCAG TAATTGTCAC TTTTCACTGG GGAAATGAGT ATGAATATGT CCCTAATGAT
ACTCAAAAAG AATTAGGAAG ATCGGCCATA GAAAGCGGTG CAGATATGGT ATGGGGGCAT
CATCCTCATG TGCTTCAAGG GATAGAACAA TACGAAGAAC GCTATATAGC TTATAGTTTA
GGAAACTTCT GTTTTGGTGG TAATAAAAAT CCTTCAGATA AAGATAGTAT GATATTCCAA
AACGAATTCA AATTTAAAAA TGGTAAAATT GAAGAAGTAG ACCACAATAT TATTCCCATA
AGTATATCTT CTAAAAAGGA GCGAAATAAT TATCAACCTA CTCCAGTCCA GAATAAAGAA
AAAGAAAGAA TCAATGAGAG AATAAAAGAA TTAAATAAAA AAATAGATTA A
 
Protein sequence
MIGKMSLWVL ILNFLLTMNL GGYNINHIDC NLIHVFEEMK KEYQTKEEVV NNDEVKSIII 
SVAGDTTLGY DEDFGYYNSF DHEFERQGKN YNYFFSNVKE IFKDSDISIL NLEGTLTNHD
QPKNKKFTFK GKPEYAKILK KGHIDAVNLA NNHTMDFGNR GFQDTKKSLE QKGIGYFGYD
LEFTKEVKGK KFSILGFTGW YVNQERKNYL SSRIEQAKAN SDAVIVTFHW GNEYEYVPND
TQKELGRSAI ESGADMVWGH HPHVLQGIEQ YEERYIAYSL GNFCFGGNKN PSDKDSMIFQ
NEFKFKNGKI EEVDHNIIPI SISSKKERNN YQPTPVQNKE KERINERIKE LNKKID