Gene Cphy_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3848 
Symbol 
ID5744800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4711613 
End bp4713079 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content40% 
IMG OID641294960 
Productanthranilate synthase component I 
Protein accessionYP_001560934 
Protein GI160881966 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTAC CAAGGTATGA AGATTTGGCA GAGCTTTCAT TTAAATTCCC TTATGTTCCG 
GTATATAAAG AAATTTATTC AGATCAGACC ACACCAATCT TAGTGATGCA GAAACTATCG
TTACACGCTA AAAACTATTA TCTATTTGAA AGTGCAGAGG GAAATGAGCG ATGGGGTCGC
TATTCCTTTC TCGGATTTAC CCCAGTACTA AAGCTGTTTG GCAAGGGTGG CAAGGTTTTT
CTTAAAAAAG GTTTAGAGGA AGAAGCCATA GAGCAAACAG GAGACAGCAT GCAGGCGATT
CGAAGTCTCC TAAAGGAGTA TAGAGCACCA AAACTTGAGA AGCTTCCATC GTTTTCCGGA
GGGCTTGTGG GGTACTTTGG CTATGAAATG ATAGGAAGAA TGGAACCTAA ATTACACCTT
AGAGAGAGTG ATTTTGAGGA GTTTTCCTTA CAGCTTTATC TAGAGGTTAT AGCATTTGAT
CATGTTAAAC AAAAGATGTA TCTGATTGAT CATTATCCAA CCAAGGAAGG AAGAAAAGGG
TATGATGAGG CAGTTCTTCG TATCGAAGCC CTTGAAACCT TGTTAGTTGA AACGATACCG
CCGGCCTTTC AGTTTAAGGA GGAAGCGCCT GTTTTTAAGA GTAATATAAC GAAGAAAGAG
TACCTTGCTA TCATAGAGAA AACAAAGCAC TACATTAGGG AAGGTGATAT CTTTCAAGGA
GTTATCTCAA GAAGGCTGGA GGCCACTTAT AAGAATAGCC TAATGAATGC ATATCGAGTA
TTAAGAACGG CGAATCCTTC TCCATATATG TACTTTATTC ACTCTGGTGA TATTGAAATT
GCTGGTTCAT CGCCAGAAAC CTTGGTAAAA GTCATCGATA GAGAAGTAAC TATCTTCCCA
ATTGCAGGGA CTAGGCCCAG GGGGAGCACG GGTGAAGAAG ACGAAAAATT GGAAAAAGAA
CTACTTGAAG ATGAAAAAGA ACTCGCGGAG CACAATATGT TAGTTGATTT GGCTAGAAAT
GATGTGGGAA GAGTGGCAGC TTATCAATCG GTTGTAGTTG AAGAATATCT AAAGGTGCAT
CGATACTCTA AGGTTATGCA CATTACTTCA AAGGTTAGTG GAAAGTTAAG AGAAGATAAG
GATGGCTGTG ATGCACTAAT TGCATCCTTT CCAGCTGGAA CTTTGACTGG AGCACCAAAG
ATACGTGCTT GTGAAATCAT AGAAGAGTTA GAAGAAAGTC CTAGAGGAAT CTATGGAGGT
GCCATAGGGT ATTTTGACCT TTCTGGGAAT CTGGATTTTT GTATTGCAAT ACGAACAGCG
GTTAAGAAGA AAGATAGCGT ATATGTTCAG GTTGGAGCTG GCATTGTGGC GGATAGTAAT
AGTGAACTTG AGTATGAAGA AACAAATCAT AAGGCAGCGG CAGTTGTTGA TGCATTACTT
AGAGCAGGGG AGGTAGATAG GGTATGA
 
Protein sequence
MVLPRYEDLA ELSFKFPYVP VYKEIYSDQT TPILVMQKLS LHAKNYYLFE SAEGNERWGR 
YSFLGFTPVL KLFGKGGKVF LKKGLEEEAI EQTGDSMQAI RSLLKEYRAP KLEKLPSFSG
GLVGYFGYEM IGRMEPKLHL RESDFEEFSL QLYLEVIAFD HVKQKMYLID HYPTKEGRKG
YDEAVLRIEA LETLLVETIP PAFQFKEEAP VFKSNITKKE YLAIIEKTKH YIREGDIFQG
VISRRLEATY KNSLMNAYRV LRTANPSPYM YFIHSGDIEI AGSSPETLVK VIDREVTIFP
IAGTRPRGST GEEDEKLEKE LLEDEKELAE HNMLVDLARN DVGRVAAYQS VVVEEYLKVH
RYSKVMHITS KVSGKLREDK DGCDALIASF PAGTLTGAPK IRACEIIEEL EESPRGIYGG
AIGYFDLSGN LDFCIAIRTA VKKKDSVYVQ VGAGIVADSN SELEYEETNH KAAAVVDALL
RAGEVDRV