Gene Jann_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_0331 
Symbol 
ID3932772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp336913 
End bp338025 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content62% 
IMG OID637902676 
Productchorismate synthase 
Protein accessionYP_508273 
Protein GI89052822 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.288386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATGA ATTCCTACGG TCACCTGTTC CGCGTCACCA CCTGGGGCGA AAGCCATGGA 
CCTGCCCTGG GCGCGACAGT TGATGGCTGC CCTCCGGGGA TCGACGTCGA TGCGGCCGCG
ATCCAGCACT GGCTCGACCG TCGCAAACCT GGCCAGAACA AATACACCAC CCAGCGGCGA
GAGGCTGATG AGGTGGAGAT TTTGTCGGGC GTCTATGAAG GACAGTCCAC CGGCACCCCC
ATCCAGCTCA TGATCCGCAA CACCGATCAG CGATCCAAGG ATTACGGCGA TATTGCCGAG
AAGTTCCGGC CCGGCCATGC GGACATTACC TATTGGCAGA AATACGGTAT CCGTGATCCG
CGTGGCGGCG GACGGTCAAG CGCGCGTGAG ACGGCAGCAC GGGTCGCCGC GGGTGGCGTG
GCGCGTCTGG CGCTTGCGGC GCTGGTGCCT GCGGTGAAGA TCACAGGTTA CATGGTGCAA
ATGGGGCCGC ACGGGATTGA TCGCGAGTGC TTCGATCTGG CGCAGGTGGA CGAAAACCCA
TTCTGGGTCC CCGATGCCAA GGCCGCTGAT GAGTGGGCCG CCTACTTGGA TGGTTTGCGC
AAGTCCGGCG ACAGCGTCGG TGCCGTGATT GAGGTTCGCG CCAGCGGGCT GCCCGCAGGT
CTTGGGGCGC CGATCTATGG CAAGCTGGAT ACCGATCTGG CCGCCGCGAT GATGAGCATC
AATGCCGTCA AAGGCGTGGA GATCGGCGAC GGCATGGCGG CCGCGGCGCT GACCGGCTCG
GCCAATGCGG ATGAGATCCA TATGGGCGAT AATGGCCCTG AATATTCCTC AAACCACGCG
GGCGGCATCC TTGGCGGTAT CTCCACCGGG CAGGACGTCA TCGTCCGGTT TGCGGTCAAA
CCGACATCCT CCATCCTCAC GCCGCGCGCG ACGATCACCA AGGCGGGCAC CCCGGCCGAG
ATCATCACCA AAGGCCGCCA CGATCCCTGT GTGGGAATCA GGGCTGTGCC GGTTGGCGAG
GCGATGATGG CCTGTGTCGT GCTAGACCAC ATTTTGCTGC AAAGAGGGCA AATTGGTGGC
AAAGTCGGGG AAACCCGGGG AAAAATCGGA TAA
 
Protein sequence
MSMNSYGHLF RVTTWGESHG PALGATVDGC PPGIDVDAAA IQHWLDRRKP GQNKYTTQRR 
EADEVEILSG VYEGQSTGTP IQLMIRNTDQ RSKDYGDIAE KFRPGHADIT YWQKYGIRDP
RGGGRSSARE TAARVAAGGV ARLALAALVP AVKITGYMVQ MGPHGIDREC FDLAQVDENP
FWVPDAKAAD EWAAYLDGLR KSGDSVGAVI EVRASGLPAG LGAPIYGKLD TDLAAAMMSI
NAVKGVEIGD GMAAAALTGS ANADEIHMGD NGPEYSSNHA GGILGGISTG QDVIVRFAVK
PTSSILTPRA TITKAGTPAE IITKGRHDPC VGIRAVPVGE AMMACVVLDH ILLQRGQIGG
KVGETRGKIG