Gene Athe_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1696 
Symbol 
ID7409206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1782378 
End bp1783748 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content37% 
IMG OID643716067 
ProductChorismate binding-like protein 
Protein accessionYP_002573563 
Protein GI222529681 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGG TGGCAAACGT GCGCCCCGAT AGCTCATCGG GGCTTTTTTG TTATCATGAT 
ATTAGAAATA GTATACCTTT CTTTGAAGAA TATTCAAACG ATTCAATTGA CCTATCAAAT
ATAGATTTTG ATTCAATCGA GGGGGATTTC TTTTTCTTTG AAAATTCCCA GATGGCAGCA
TTTTGCACTG ATAGGTTGTT ATGTATCACA GTTTACCATG ACAGGACCGA GATTGATTTT
GAAGGGACTA TAGATGTTTT CTATGGCGAT GTTTCTGCAA TGCTTGATAG CATATTGGAA
ATCTTAATTC AAAAGAATGC TTATATCACC TGTCAATTTA ACTATCATGC AGTAAATTTG
TTAGAAGACA TATATTCTGC TGACAGTCCC TATATAGTTC TAAATGTTTA CAGAAAAAAT
GTACTTGTTG ATAAACTTAC AGGCAAGAAA ACTCTTTTGG TGAGCAAGGA ATCTGAAAAA
GATGCAGAGG TAGATTTTAA GAGGTACCAG AGAAGTTTGT TTGATGTAAA AAGCAATGTG
GTCTTTTCAA CACCAAAAGA GTACTTTATC AGCACAGTCA AGCAGGCAAA AGAGGATATC
AGAAATGGTG AGATTTTTCA GATTGTTCTG TCTCAGATAA TATTGGTCAA AAGCAATATA
TCAACAAACC ATCTTTTTTA CACAATGAAA GAGAGAAATC CTTCAGAGTA CAGCATTGTG
ATAAACAATG AAGAAAGCCA AGTGATTTGT TTTTCGCCAG AGACTCTTAT AAAGAAAAAA
GGAAACACAG TAAAGACATT TCCAATTGCA GGAACGTACA GGATAAACGA AGGCGATGAT
GTTGCCCAGA AAAAGATTGA GATACTGAAA GACAAGAAAG AGATAAGTGA ACATGTCATG
CTTGTTGACC TTGCGCGAAA TGATCTTGGA AGGATTTCAA AACCCGGGAC TGTAAAAGTA
GAAGAGTACT TGAGAATAAA AAGGCTTTAT AATCTCATTC ATATATATTC AGTTGTTACA
GGTGAACTTG AAGAAAAGAG CCTCACAAAA ACGATACTAT CTGTTTTTCC GGCCGGGACG
CTGACCGGCG CACCCAAGAT AAGAGCTATG CAGCTAATTG AAAAGTACGA AAGGCAGAGA
AGAGATCTTT ACGGAGGAGC AATTGGATAT ATCTACAAAG ACCAGTTTGA CCTTGCCATA
GCTATAAGAA TGGCTGTGAA GGACAAAAAG GAAAGCATTA TCAAGCTTCA AAGTGGTGCG
GGAATTGTAA ATTTGTCAGT GCCTGAGAAT GAGTATCAGG AGTGTTTGAC CAAGCTCAGA
GCGTTTTTGA GGATAATGGA GGTGAATGAG GATGATATTG TTAATAGATA A
 
Protein sequence
MEKVANVRPD SSSGLFCYHD IRNSIPFFEE YSNDSIDLSN IDFDSIEGDF FFFENSQMAA 
FCTDRLLCIT VYHDRTEIDF EGTIDVFYGD VSAMLDSILE ILIQKNAYIT CQFNYHAVNL
LEDIYSADSP YIVLNVYRKN VLVDKLTGKK TLLVSKESEK DAEVDFKRYQ RSLFDVKSNV
VFSTPKEYFI STVKQAKEDI RNGEIFQIVL SQIILVKSNI STNHLFYTMK ERNPSEYSIV
INNEESQVIC FSPETLIKKK GNTVKTFPIA GTYRINEGDD VAQKKIEILK DKKEISEHVM
LVDLARNDLG RISKPGTVKV EEYLRIKRLY NLIHIYSVVT GELEEKSLTK TILSVFPAGT
LTGAPKIRAM QLIEKYERQR RDLYGGAIGY IYKDQFDLAI AIRMAVKDKK ESIIKLQSGA
GIVNLSVPEN EYQECLTKLR AFLRIMEVNE DDIVNR