Gene Tneu_1178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1178 
Symbol 
ID6165099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1068466 
End bp1069734 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content65% 
IMG OID641668328 
Productanthranilate synthase 
Protein accessionYP_001794553 
Protein GI171185634 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.265376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC CGCTGTCTAA GCTCCCGCCT CCGAGGGATC TGGCCCACGG GCTGTACCAG 
TCGGGGGAGG AGTTCGTGGC TCTTCTGGAG TCGGGCCAGG GATTCGCGGA GAGGGCGAGG
TTCACCCTCG TGGCGTGGGG GGTGGAGAGG GCGTACGTCT CCTCTGGGCC CGACCTCCAG
CAGGTGCTCT ACTCGGCGCA AAGGGAGCTG AGGGCGGACG GGGGGCCCTT CGGCGGCGAC
GTGTTAATCG GCGCCTTGAC CTACGAGGCG TCCTACTACG TGGAGCCTCT GTTGCTTAGG
TACAACAAGG TGGACCGGTC TATCCCGGCG GCGTTTCTGG TTAAGCCCAG GGGGTACATC
CTGTACGACA AGATGCTGGG GAGGGGCTAC CTGAGGGGCG AGATGCCGAG GGTCTCCGTG
GGGCGGGGGG AGGCCAGGGT GAGGGGGCCG GTGGCCATGA CCGACCCGGG CCGCTTCAAG
AGCTGGGTGG CGGAGGGGAG GGAGAGGATC GCGGCTGGGG AGATCCTCCA GGTGGTGCTC
TCCAGGTGGG TGGACTACAG GGCGGAGGGG GACCTCTTCC CTCTGTACAA GGCGCTGGCG
GAGGGGAACC CCTCGCCGTA TATGTACTTT GTAAAATACG GCGATATCCA CTTGATTGGG
ACGTCGCCTG AGCTGTTGGT GAAGGTGCAG GGCGGCCGCG TGGAGACCCA CCCCATCGCC
GGGACTAGGC CGAGGGGCGC CACCGAGGAG GAGGACCTGG CGCTGGAGGA GGACATGCTC
AGCGACGAGA AGGAGCTGGC TGAACACATC ATGTTGGTGG ATCTGGCTAG GAACGACATC
GGGAGGGTGT GCCAGCTCGG GTCTGTCAAG GTGGAGGAGC TGTTCGCCGT GGAGAAATAC
AGCAGGGTGC AGCACATAGT GTCTAGGGTC ATGGGCGTTA TGGACAGGCG GTTCACCCCC
GTCGACGCCC TCTTGGCCAC CCACCCGGCG GGCACCGTGT CGGGCGCCCC CAAGGTGAGG
GCTATGGAGA TAATCGCCGA GCTTGAGGAC GAGCCTCGGA GGTTCTACGC GGGAGCCGTG
GGCTTCATGT CGCCTTCTCT CCTGGAGTTC GCCATAGTCA TAAGGACCAT GGTGGCCGTG
GGCGACTCCC TCCGTATACA GGCGGGGGCG GGGGTTGTGT ACGACTCCAC GCCGGAGCGG
GAGTTTAGAG AGACCGAGTC TAAGCTGGCT GCGCTTAAAG CGGTCGTGGA GGGTGGGCCA
TGGACCTAA
 
Protein sequence
MKIPLSKLPP PRDLAHGLYQ SGEEFVALLE SGQGFAERAR FTLVAWGVER AYVSSGPDLQ 
QVLYSAQREL RADGGPFGGD VLIGALTYEA SYYVEPLLLR YNKVDRSIPA AFLVKPRGYI
LYDKMLGRGY LRGEMPRVSV GRGEARVRGP VAMTDPGRFK SWVAEGRERI AAGEILQVVL
SRWVDYRAEG DLFPLYKALA EGNPSPYMYF VKYGDIHLIG TSPELLVKVQ GGRVETHPIA
GTRPRGATEE EDLALEEDML SDEKELAEHI MLVDLARNDI GRVCQLGSVK VEELFAVEKY
SRVQHIVSRV MGVMDRRFTP VDALLATHPA GTVSGAPKVR AMEIIAELED EPRRFYAGAV
GFMSPSLLEF AIVIRTMVAV GDSLRIQAGA GVVYDSTPER EFRETESKLA ALKAVVEGGP
WT