Gene Cphy_3093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3093 
Symbol 
ID5743179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3782954 
End bp3784057 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content39% 
IMG OID641294193 
Productchorismate synthase 
Protein accessionYP_001560188 
Protein GI160881220 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000101466 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGTT CAAGTTTCGG ATCAATTTTT AAAATAGCAA CTTGGGGAGA ATCCCATGGA 
AAAGGTATCG GCGTTGTTGT TGACGGGTGT CCTGCAGGTC TTACTCTAAA TGAAGAAATG
ATTCAGACAT TTCTAAACCG TCGAAAACCT GGGCAAACGA AATATTCGAC TCCAAGAAAA
GAAGATGATC TTGTAACAAT CCTATCTGGT GTTTTTGAAG GAAAAACTAC AGGTACCCCA
ATTTCCATGA TGATTGCAAA TGAGACTGCA CGTTCTGCAG ATTATAGTGA AATAGCAAGC
TTTTATAGAC CTGGTCATGC AGACTATACT TTTGATGCAA AATACGGTTT TCGTGACTAT
CGCGGGGGTG GACGTTCCTC AGGACGTGAA ACAATTGGAC GTGTAGCAGC AGGTGCAATC
GCTGCTGCCC TCTTAAAAGA ACTAGGAATT GAAGTTTTTA CTTATACCAA ATCCATTGGT
CCTATTCAAA TTGATTATCA TAAGTGCCAA AAAGAAAACT TAACTTTAAG TCCTCTTTGC
ATGCCAGATT TAGAAGCATC TCAGAAAGCG GAAGATTATC TAGAGCAGTG CATTCACAAT
TTAGACTCTA GTGGTGGTAT GATTGAATGC ATTATATCTG GAGTTCCAGC AGGAATTGGG
GAACCAGTAT TTGATAAATT AGATGCGCAG CTTGCAAAGG CGATATTCTC TATTGGCGCT
GTAAAGGGCT TTGAGATTGG ATCTGGTTTT GAAGTAGCAA AACAGTTAGG TTCCGAAAAT
AATGATGGGT TTGCATTCGA TGCAAATGGA AAACTCATTA AGTTAACCAA TCATTCTGGC
GGTATCCTTG GAGGAATTAG TGATGGCTCC GAAATTATCT TCCGGGCTGC AATTAAACCA
ACTCCTTCTA TAAAAAAAGA ACAGCAAACC GTTAATAAAT CAGGTGAGAA CATAAATGTA
TCTATAAAAG GCCGTCATGA TCCAATTATA GTCCCAAGGG CAGTTGTTGT TGTGGAAGCG
ATGGCAGCCT TAACTCTAGC AGATTTGTTA CTGAGTGGTA TGTCCTCAAA AATGGATTAC
GTAAAGAAAA TCTATCAAAA ATAA
 
Protein sequence
MSGSSFGSIF KIATWGESHG KGIGVVVDGC PAGLTLNEEM IQTFLNRRKP GQTKYSTPRK 
EDDLVTILSG VFEGKTTGTP ISMMIANETA RSADYSEIAS FYRPGHADYT FDAKYGFRDY
RGGGRSSGRE TIGRVAAGAI AAALLKELGI EVFTYTKSIG PIQIDYHKCQ KENLTLSPLC
MPDLEASQKA EDYLEQCIHN LDSSGGMIEC IISGVPAGIG EPVFDKLDAQ LAKAIFSIGA
VKGFEIGSGF EVAKQLGSEN NDGFAFDANG KLIKLTNHSG GILGGISDGS EIIFRAAIKP
TPSIKKEQQT VNKSGENINV SIKGRHDPII VPRAVVVVEA MAALTLADLL LSGMSSKMDY
VKKIYQK