Gene Tneu_0780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0780 
Symbol 
ID6164324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp699421 
End bp700518 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content62% 
IMG OID641667938 
Productchorismate synthase 
Protein accessionYP_001794165 
Protein GI171185246 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.151802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.220812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCT TCGGAAGAGA GCTCCGGATA ACCACCTTCG GCGAATCCCA CGGCAAGGCC 
ATAGGCGTGG TGATAGACGG CGTGCCGGCT GGGCTGGAAC TGACGGAGGA GGACATCAAG
AGGGAGCTGG AGAGGAGGAT GTTCTGCCAC ATACCAGTCC TCAACCCGAG GTGCGAGCCG
GAGGAGGTGG AGATACTATC CGGCGTGAAG GAGGGCTACA CCCAGGGCAC CCCCATAGCC
GTCGTGATAT GGAACAGACG CGTCATCTCC AGCTACTACG AGGAGCTCTG GATGAAGCCC
AGGCCGGGCC ACGCCGACTT CGCCTACTAC CTCAAATACG GCAGACACTA CGACCACAGG
GGGGGAGGCA GAGCCTCCGG TAGAACAACC GCGGCTGTGG TGGCGGCGGG GGCAGTCGCC
AAGAAGATGC TCGCCCTAGC CGGCGCCGAG GTGGCCGGCC ACATAGTCGA GCTAGGCGGC
GTCGAGATAA ACGCCAGCTA CACCTACGAA GACGTCAAAA AAAGCTGGGG GCGGCCCCTC
CCCGTGGTGG ATCAACAAGC CCTAGACAAA ATGCTGGAAA AGATCCGGGA GGCCGCCATG
AGGGGAGACA GCATAGGCGG GGGGGTGGAG GTCTGGGCCG TGGGGGTGCC GCCCGGCCTG
GGGGAGCCCC ACTTCGGCAA GATAAAAGCC GACATAGCCG CCGCCGCCTT CTCCATACCA
GGCGCCATAG CGCTCGACTG GGGCATGGGC AGAGCGCTGG CGAAGATGTG GGGAAGCGAG
GCCAACGACC CCATAACAGT CGCCAACGGC AGGCCAACCC TCGCCACCAA CAAAATCGGC
GGCGTCCTCG GCGGAATAAC CGTGGGAACC CCCATATACT TCAGAGCCTG GTTCAAGCCC
ACCCCCTCCG TCAGAAAGCC GCAGCAGACG GTGGACCTAG CCAAGATGGA GCCTACGACG
ATAGAGTTCA AGGGGAGATA CGACGTGTCC ATAGTCCCCA AAGCCCTCGT GGCGCTGGAG
GCCATCACGG CGGTAGCACT CGCCGACCAC CTACTCAGGG CAGGTCTCAT AAGAAGAGAT
AAGCCGCTGG GGAGATAG
 
Protein sequence
MNTFGRELRI TTFGESHGKA IGVVIDGVPA GLELTEEDIK RELERRMFCH IPVLNPRCEP 
EEVEILSGVK EGYTQGTPIA VVIWNRRVIS SYYEELWMKP RPGHADFAYY LKYGRHYDHR
GGGRASGRTT AAVVAAGAVA KKMLALAGAE VAGHIVELGG VEINASYTYE DVKKSWGRPL
PVVDQQALDK MLEKIREAAM RGDSIGGGVE VWAVGVPPGL GEPHFGKIKA DIAAAAFSIP
GAIALDWGMG RALAKMWGSE ANDPITVANG RPTLATNKIG GVLGGITVGT PIYFRAWFKP
TPSVRKPQQT VDLAKMEPTT IEFKGRYDVS IVPKALVALE AITAVALADH LLRAGLIRRD
KPLGR