Gene Tpet_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpet_0783 
Symbol 
ID5171769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermotoga petrophila RKU-1 
KingdomBacteria 
Replicon accessionNC_009486 
Strand
Start bp795149 
End bp796522 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content48% 
IMG OID640563302 
Productanthranilate synthase component I 
Protein accessionYP_001244378 
Protein GI148269918 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0095875 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGTGC TGAAGCTCGT TTCGGATCTG GAAACACCGG TTTCCACTTT TATGAAGGTT 
TCCAGAGGAG AAGAATTCGC TTTCCTGCTT GAAAGCGTGG AGCTCGGTTC CGCTTTCGGA
AGGCATTCCT TCATAGGTAT AGGAAAAAAA GACGTGCTCG TTTTCGAAAA GGGAATCTTG
AGAACTTCGA ATCAGCAGCT CGATTATACT TCCTCTCCTC TCAAAGCGAT AAAAGACTGG
CTTGAGGTCT ATAGATACAA CGTCAAACAC GACGAACTCC CTTCGTTCAG GGGAGGAGCA
GTTGGCTTCG TCAGTTATGA CTACATTTCT TACATAGAGA AAGTCAAAGT CAGAGCCTCT
GTGTTTCCCA CGTTTTATTT CGTGGTGCCT GAGCATCTCA TAATATTCGA TCACCTGAAG
AACAACGTCT TCATCATCAG CGATAGTCCT GAGGAACTTG CATCGAAGGT CTTGAGCCCT
TTTGAAGAGG AACCAGAGAA GAACGTTTTT GTGACAGAAC CGGAATCGAA CTTCGAAAGA
GAGCAGTTCT ACAAAGTGGT TGAAAAGGCC AAAAAGTACA TTGTAGAAGG TGACATATTC
CAGGTGGTTC TCTCTCAGGC TTTCACATTC AAAACGACGC TGGATCCGTT CTACATCTAC
AGGGCGCTCA GAATGATCAA TCCTTCTCCC TACATGTTTT ATCTGAAGTT CGGAGACACG
GTCGTTTTGG GATCTTCTCC CGAAACCATG GCGAAGGTGG AAGGAGACAA GGCCACTGTG
AAGCCAATTG CCGGCACCAG ACCGCGGGGA AGAACTGTTG AAGAAGATCT GAAGCTGGAG
AGAGAACTTT TGAACGACGA AAAAGAGATA GCAGAGCACG TTATGCTTGT GGATCTTGGA
AGAAACGACC TTGGAAGAGT CTGTAAAGAA GGAACCGTGC GAGTGGAGAA GAAGATGGTT
ATCGAAAGGT ACTCCCATGT TATGCACATA GTCTCGCAGG TGAGCGGTGA ATTGAAGGAC
GATAAAGATG CTGTGGATGT GTTCGAAGCG ACTTTCCCTG CGGGGACAGT CTCTGGCGCA
CCCAAAGTGA GGGCAATGGA GATCATAGAA GAACTCGAAC CCACCCCACG AGGGCCATAC
GCTGGAGCCG TCGGGTATTT CTCTTTCCCC GATGATAAAG GAAAGATGAA CATGGATTCT
GCTATCACCA TCAGGTCCTT TTTCTTCAAA GGTAAACAGG GATGGCTCCA GGCAGGAGCG
GGTATCGTTT ACGATTCCGT TCCGGAGCGT GAGTATCAGG AGACTCTGAA CAAACTCAGG
GCGTTGTTCA GAAGCCTGGA AGTTGCGCAG AAGATCCAGG GGGGATTGTT TTGA
 
Protein sequence
MHVLKLVSDL ETPVSTFMKV SRGEEFAFLL ESVELGSAFG RHSFIGIGKK DVLVFEKGIL 
RTSNQQLDYT SSPLKAIKDW LEVYRYNVKH DELPSFRGGA VGFVSYDYIS YIEKVKVRAS
VFPTFYFVVP EHLIIFDHLK NNVFIISDSP EELASKVLSP FEEEPEKNVF VTEPESNFER
EQFYKVVEKA KKYIVEGDIF QVVLSQAFTF KTTLDPFYIY RALRMINPSP YMFYLKFGDT
VVLGSSPETM AKVEGDKATV KPIAGTRPRG RTVEEDLKLE RELLNDEKEI AEHVMLVDLG
RNDLGRVCKE GTVRVEKKMV IERYSHVMHI VSQVSGELKD DKDAVDVFEA TFPAGTVSGA
PKVRAMEIIE ELEPTPRGPY AGAVGYFSFP DDKGKMNMDS AITIRSFFFK GKQGWLQAGA
GIVYDSVPER EYQETLNKLR ALFRSLEVAQ KIQGGLF