Gene TM1040_2819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2819 
Symbol 
ID4076638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2983051 
End bp2984163 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content59% 
IMG OID638008145 
Productchorismate synthase 
Protein accessionYP_614813 
Protein GI99082659 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0159117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.634482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGA ATTCTTTTGG CCACCTTTTT CGAGTCACCA CCTGGGGTGA AAGCCATGGG 
CCCGCTTTGG GCGCTACGGT TGATGGCTGC CCGCCAAATG TCGCAGTCTC GGAAGAGATG
CTTCAACACT GGCTCGACAA GCGCCGCCCC GGTCAGAACA AGAACACCAC CCAGCGCAAT
GAGCCTGACG CGGTGAGAAT CCTCTCTGGT GTGTTTGAGG GAAAATCGAC CGGCACGCCG
ATTCAATTGA TGATTGAAAA CACCGACCAG CGCTCACGGG ATTATGGCGA GATCGCCCAG
ACATTCCGCC CAGGACATGC GGACATTACT TACTTTCAGA AATACGGCAA CCGCGACTAT
CGCGGTGGTG GACGATCCTC TGCACGCGAA ACCGCAGCGC GGGTGGCAGC AGGTGGCGTA
GCGCGCGAAG CCCTCAAGTC CTTGGCCCCG GGGATCGAGA TCAAGGGCTA TATGACCCGC
ATGGGGGAAA TGGAAATCGA CCGCGCGCGG TTTGACTGGT CTGCCATCGA TCAGAACGAC
TTCTGGATTC CCGATGCCGC CGCTGTTCAG GACTGGGAAG ACTATCTGCA GGCCCTTCGC
AAGCAGCACG ACTCGGTTGG CGCTGTTGTC GAAGTGGTCG CGCGCGGCGT ACCCGCAGGC
ATCGGAGCGC CAATCTACGG CAAGCTCGAC ACCGATCTCG CAGCGGCAAT GATGTCGATC
AACGCTGTCA AAGCCGTCGA AATCGGCGAA GGCATGAATG CTGCGTTGTT GAAGGGCTCC
GAGAACGCGG ATGAGATCTT CATGGGCAAT GACGGTGCGC CTGTCTACTC GTCGAATCAC
GCGGGTGGTA TCTTGGGAGG AATCTCAACG GGCCAAGATG TGGTGATCCG TTTCGCCGTG
AAGCCGACCT CGTCCATTTT GACCCCGCGC CAGTCGATTC GCAAAGACGG GACGGCTGCC
GAAGTCATCA CCAAGGGCCG CCATGATCCC TGTGTCGGTA TCCGCGCAGT GCCCGTGGCA
GAGGCCATGA TGGCCTTTGT GATTCTCGAT CACATCCTGT TGCACCGCGG GCAGATTGGC
GAGAATCAGG GCGTGATCGG CGCGCCCGAC TGA
 
Protein sequence
MSMNSFGHLF RVTTWGESHG PALGATVDGC PPNVAVSEEM LQHWLDKRRP GQNKNTTQRN 
EPDAVRILSG VFEGKSTGTP IQLMIENTDQ RSRDYGEIAQ TFRPGHADIT YFQKYGNRDY
RGGGRSSARE TAARVAAGGV AREALKSLAP GIEIKGYMTR MGEMEIDRAR FDWSAIDQND
FWIPDAAAVQ DWEDYLQALR KQHDSVGAVV EVVARGVPAG IGAPIYGKLD TDLAAAMMSI
NAVKAVEIGE GMNAALLKGS ENADEIFMGN DGAPVYSSNH AGGILGGIST GQDVVIRFAV
KPTSSILTPR QSIRKDGTAA EVITKGRHDP CVGIRAVPVA EAMMAFVILD HILLHRGQIG
ENQGVIGAPD