Gene Nther_1776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1776 
Symbol 
ID6314297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1843472 
End bp1844539 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content37% 
IMG OID642644150 
Productaminodeoxychorismate lyase 
Protein accessionYP_001917936 
Protein GI188586391 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.81136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.183273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGT TTTCTTTAAA AAATTATTTT AGCCAGCGAA AAAAAATGAT GATATACTTG 
ATAACAGGAT TTATGGCTGC TATTTTATTA GGCTCAGCAG GGATATATTT TTATATTTAC
AGTGGCTTAC AATCAGTAGA TATTGACGAA GAAATTGAAA TTGAAATTCC CAGGGGAAGT
AATTTAAGAC AGGTTGCCGA TATTTTAGAA GATAATGGCA TCATTAGAGA TGCAACTTTG
TTTAGATATT ATGCCAGGTT TTCTGGTTAC GATGCTCAAT TACAAGCGGG TGAATATTTA
TTCGAGGATG AGATTGCTCC CGAGGAGGTA TTGATTAAGC TTGCCCAGGG CGATGTTATC
GATCGGAGTA TTAGATTTAC AATCCCAGAA GGCCTTAGAG CTGATCAGGT AGCTCAAAGG
CTTGAAAGTC AAGGATTAGG TGATAAAGAT AAATTTCTTG AATTATTCTC AGAACCTGAA
GAGTGGGATT ATTGGTTCCT AGAGGGGTTA GCAGAAGAGC ATGTTAAGTT TCCATTAGAA
GGTTTTCTAT ATCCCGATAC TTATCAAGTC CAGGAGGATA TTAGTGAAGA AGAAGTAGTG
AAAAGAATGT TAGATCAATT TAATGAAGTA TTTGACGAAA GTTATCAAGA AAAGAAAGAA
CACCAAGGAT TTAATATCCA TGAATTGATA ACTATTGCTT CGATTGTAGA AAGAGAGGCA
GTAATTGACG ATGAGCGTGG CAAGGTAGCA GGGGTTTTCC TAAATAGATT GGAGAACAAT
ATGCGCTTAG AAGCCTGCGC AACAGTGGAA TATGTTCTTC AGGAAAACAA ACCCGTACTT
TCTGATGCAG ACACTCAAAT TGAAACACCA TACAATACAT ACCAAAATTC CGGATTACCA
CCTGGACCAA TTGCATCACC AGGTCGTGCT TCTATTGAAG CAGCTCTAGA CCCCAAGGAA
CACGATTATT TGTTTTTTGT TGCAAAACAT GATGGAAGTA GAACTCATGT GTTTTCAGAA
ACGTATCAAG AACATTTGCA GGCTAAAGAA AGAGTTAGAG CTGATTAA
 
Protein sequence
MAQFSLKNYF SQRKKMMIYL ITGFMAAILL GSAGIYFYIY SGLQSVDIDE EIEIEIPRGS 
NLRQVADILE DNGIIRDATL FRYYARFSGY DAQLQAGEYL FEDEIAPEEV LIKLAQGDVI
DRSIRFTIPE GLRADQVAQR LESQGLGDKD KFLELFSEPE EWDYWFLEGL AEEHVKFPLE
GFLYPDTYQV QEDISEEEVV KRMLDQFNEV FDESYQEKKE HQGFNIHELI TIASIVEREA
VIDDERGKVA GVFLNRLENN MRLEACATVE YVLQENKPVL SDADTQIETP YNTYQNSGLP
PGPIASPGRA SIEAALDPKE HDYLFFVAKH DGSRTHVFSE TYQEHLQAKE RVRAD