Gene Athe_2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2217 
Symbol 
ID7408414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2347219 
End bp2348538 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content33% 
IMG OID643716585 
ProductAnthranilate synthase 
Protein accessionYP_002574064 
Protein GI222530182 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGATTT TAGTTTATAA CATAGTACCT GACCCATTTT TGATTTTTTG TCATTTGAAA 
AGTGATTTTT CAGTTTTGCT TGAGAGCAGT ATGCTGAGCA AGAGGTATGG AAGATATTCT
TTTTTGTTTT TAAAGCCCAA AGAAGTTTTT ATCTGCAATG AAGATGATGA TGTTTTTGAA
TATTTAAAGC AACTTTCAGA CAAAGTAGAA AAAAATAAAG ACACTTCTGA TTTTGTTTTT
AATGGTGGGT TTGCAGGATA TTTTTCTTAC AACTTTGGTG TTGACCTTTT TGAAATTGAA
AGGAAAAAAG ATACTTCTCC TATTCCCAAA GCGTATTTTG GATATTTTGA AGATTTTGTT
GTTATTGACC ACTTTGAAAA AAAGACATAT GCTTCTTTTA CATCTAAAGC TTTAGCACAA
GAATTCGAAA TGATCCTAAT AAGTGAAAAC CTTGCTCTGC CAAGCTTTAA AAGAACTTGC
ATTGAAAGAG CTTGGTGCAA CTTTGAAAAG TCAGAATACA TGCAAGCGGT GAAAAGAATA
AAGAATTATA TTTTTGAAGG TGATGTTTAC CAAGTAAATC TTTCACAGCG CTTTTTTGTA
AAAGGAGTAT TTGACTCTGA CTTTTTGTAT TTTAACCTCA GAAAAAGAAA CTATGGCTGT
TACCATGCAT ATATAAAGCT TCCAAAAGCA TCTGTAATAT CAACATCGCC CGAACTTTTT
TTAAGAAAAA GAGGGGATAT CCTCATAACC AAACCAATAA AAGGTACATC TAAACGTGGA
AAAACCCCAG AAGAAGATAG AATTTTAAAA AAGCAGTTAT ATAACGATAT AAAGTGCAGG
TCAGAGCTTT TGATGATTGT TGACCTTGAG AGAAACGATT TTGCAAAGAT ATGTTTGCCA
GAGTCCATTG AGGTTGAAAA ACTTTTTGAT GTTGAAGAAT ACTCAACAGT TTTTCATCTT
GTTTCAACTA TAAAGGGAAA GCTTCTAAAT GGTATGGATT TAAAAAGGAT AATTGAAGCA
ACCTTTCCTG GCGGGTCAAT AACAGGTGCA CCAAAACTGA ATGCTATAAA GATTATAGAG
GAGCTTGAAA AGTGTCCGCG GGGGATATAC TGTGGTTCTA TTGGTTATAT CTCAAACAAT
TTCAACATGG ATTTTAACAT TGCAATCAGA ACGCTTGTTA TAGAAGAGGA CACAGTATAC
TTTAGTGTTG GGGGCGGAAT TGTTTGGGAC TCTCAAGAAG AGGAAGAGTG GTGGGAGACA
ATCCACAAAG GAAGACCATT TATAGAAATT TTAGGAATTA ATTATTTTAC ATTTATGTAA
 
Protein sequence
MQILVYNIVP DPFLIFCHLK SDFSVLLESS MLSKRYGRYS FLFLKPKEVF ICNEDDDVFE 
YLKQLSDKVE KNKDTSDFVF NGGFAGYFSY NFGVDLFEIE RKKDTSPIPK AYFGYFEDFV
VIDHFEKKTY ASFTSKALAQ EFEMILISEN LALPSFKRTC IERAWCNFEK SEYMQAVKRI
KNYIFEGDVY QVNLSQRFFV KGVFDSDFLY FNLRKRNYGC YHAYIKLPKA SVISTSPELF
LRKRGDILIT KPIKGTSKRG KTPEEDRILK KQLYNDIKCR SELLMIVDLE RNDFAKICLP
ESIEVEKLFD VEEYSTVFHL VSTIKGKLLN GMDLKRIIEA TFPGGSITGA PKLNAIKIIE
ELEKCPRGIY CGSIGYISNN FNMDFNIAIR TLVIEEDTVY FSVGGGIVWD SQEEEEWWET
IHKGRPFIEI LGINYFTFM