Gene DET1481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1481 
SymboltrpE 
ID3229289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp1339193 
End bp1340650 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content50% 
IMG OID637121041 
Productanthranilate synthase component I 
Protein accessionYP_182181 
Protein GI57233826 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTACC CATCTTTAGC CGAAGTAAAA AAACTGGCCG CACAGGGCAA CCTGATACCC 
ATCTCCTGTG AGATTATGGC CGACCTTGAA ACCCCGGTTT CCGCTTTTCT GAAAATCAAA
GACAGCCAAA ATGCTTTTCT GCTGGAAAGT GTGGAAGGCG GGGAGCGCGT AGCCCGATAT
AGTTTCATCG GCACCAACCC CCATAAAGTG CTTACAGCCT ATCAGACAGA TACCGTTCCC
CCTCTAACCC AAGTTGAAAA TGAACTAAAC AAATACCGGG TAGTACCGGT GGGGGATTTG
CCCCGTTTCT GCGGCGGGGC GGTGGGTTTT CTGGGCTACG AGGCAGTTAC CCGCTTTGAG
GAGTTGCCCT CGCCATCGGC TGACCCCCTA AATCTTCCGG AAGCAGTCTT CATGCTGGTT
GATACCATGC TGGTTTTTGA CCACATCAGC CATTCCATAA AAGTACTAAG CTATGTCCAT
ACCGAACAGG ATATTGAAAC GTCATATAAT CAGGCTATCC GGAATATAGA AAATCTGGTT
AACCGCCTTA GAAAGCCGCT GCCCGAAACC GCCCCAAAAT CTACCGCCGC AAGTATCCCC
GAAATGAAAT CCAATTTCAA ACAGGCGGAT TTTGAGGGCA AGGTATCCAA AATAAGAGAT
TACCTTAACT CAGGCGAAGC TATTCAGGTA GTTTTGTCAC AGCGTCTGTC CAGACCCACC
TCCGCCCATC CCTTTGACAT CTACCGTGCC TTGCGTTCGG TAAACCCTTC ACCATACATG
TACTATCTGG ATTTCGGTGA TTTCCAGATT GTGGGCGCCT CGCCGGAGGT ACTGGTACGG
GTGGAAGACG GCGAGGTTAT GACCCGCCCT CTGGCAGGCA CCAGAAAACG GGGCAAAACC
CAGAAAGAAG ACACCAGTCT TGAGCAGGAA CTCCGCCATG ACGAAAAAGA GTGTGCCGAA
CATATCATGC TGGTGGATTT GGGACGAAAC GATATCGGGC GTATAAGCCA GCCGGGCACA
GTCCGCATAA CCGACGTCAT GGATGTGGAA CGCTATTCCC ACGTAATGCA TCTGGTTTCC
CACGTACAGG GCAAATTAAA ACCAAACATT ACTCCGTTTG AGGCTTTGCA ATCCTGCTTC
CCGGCAGGCA CAGTCTCAGG CGCACCTAAA ATACGAGCTA TGGAAATAAT AGCTGAAATG
GAAACCGAAA AGAGAGGCAT TTATGCCGGG GCAGTCGGAT ATTTTTCTTA TTCGGGCAAT
ATGGACATGG CTATAGCCAT ACGCACCATG GTTGTCAAGG GAGGCATTGC CCATATCCAG
GCAGGCTGCG GCATAGTAAG TGACAGCGTA CCCGAACATG AGTATCAGGA AACATTAAAC
AAAGCTCAGG CTTTGCTGAA AGCTCTGGAC AGGGCAGAAA ATCAGGCATC GGAGAAACCG
CATGTTATTA CTAATTGA
 
Protein sequence
MYYPSLAEVK KLAAQGNLIP ISCEIMADLE TPVSAFLKIK DSQNAFLLES VEGGERVARY 
SFIGTNPHKV LTAYQTDTVP PLTQVENELN KYRVVPVGDL PRFCGGAVGF LGYEAVTRFE
ELPSPSADPL NLPEAVFMLV DTMLVFDHIS HSIKVLSYVH TEQDIETSYN QAIRNIENLV
NRLRKPLPET APKSTAASIP EMKSNFKQAD FEGKVSKIRD YLNSGEAIQV VLSQRLSRPT
SAHPFDIYRA LRSVNPSPYM YYLDFGDFQI VGASPEVLVR VEDGEVMTRP LAGTRKRGKT
QKEDTSLEQE LRHDEKECAE HIMLVDLGRN DIGRISQPGT VRITDVMDVE RYSHVMHLVS
HVQGKLKPNI TPFEALQSCF PAGTVSGAPK IRAMEIIAEM ETEKRGIYAG AVGYFSYSGN
MDMAIAIRTM VVKGGIAHIQ AGCGIVSDSV PEHEYQETLN KAQALLKALD RAENQASEKP
HVITN