Gene Daro_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0858 
Symbol 
ID3569845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp930041 
End bp931141 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID637679316 
Productchorismate synthase 
Protein accessionYP_284084 
Protein GI71906497 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.52804e-17 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000936669 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCCGGCA ATACATTTGG TACTTTGTTT ACTGTTACCT CTTTTGGTGA GTCGCATGGC 
CCGGCCATTG GCTGCGTGGT TGACGGTTGT CCGCCGGGGC TGGCCTTATG CGAGGCCGAT
ATACAGGCGG AACTGGATCG CCGCAAACCG GGTACTTCTC GTCATGTGAC GCAGCGCCGC
GAACCGGATA CTGTTGAGAT TCTCTCCGGT GTCTTCGAGG GGAAGACGAC CGGCACACCG
ATTGGCTTGT TGATTCGCAA CCAGGACCAG CGCAGCAAGG ATTACGGCAA CATTGCCGAT
ACTTTCCGTC CTGGCCATGC CGACTATGCC TACACCCAGA AATACGGATT TCGTGACTAT
CGTGGCGGTG GCCGTTCGTC AGCCCGCGAG ACGGCGGTGC GCGTGGCGGC CGGGGCGATT
GCCCGCAAGT GGCTGCACGA ACGCTTCGGG GTGGCGATTC GTGGCTGGAT GAGTGCGCTC
GGGCCAATCG AAATTCCGTT TGTTAGTGCT GATGCGATTG ATGGCAACGC CTTCTTTGCG
CCGAATTCGG CCATCGTGCC GGAGCTGGAG GCTTATATGG ATAAGCTGCG CAAGTCGCTG
GACTCTGTGG GCGCCAAGAT CACTGTAACC GCTACCGGTG TGCCTCCGGG TTGGGGTGAG
CCGGTCTATG ATCGGCTCGA TGCCGAGATC GCCTACGCGA TGATGGGGAT CAATGCCGTC
AAGGGGGTTG AAATCGGTGC CGGTTTCGAT TCGGTCGCCC AGAAAGGCAG CGAGCATGGC
GATGAAATGA CGCCACAGGG CTTTGCGACC AACCATGCCG GTGGTGTGCT CGGTGGTATT
TCGACAGGGC AGGAAATCGT GGTCAATATG GCGATCAAGC CGACCTCGTC AATTGCCCAG
TCGCGCCGCT CGATCAATCG CCAGGGGGAG GCTATTGAGG TGGCAACCGA GGGGCGGCAT
GACCCCTGTG TCGGCATTCG TGCCACGCCG ATTGCCGAAG CGATGCTGGC CTTGGTTCTG
ATGGATCATG CTTTGCGTCA TCGTGCCCAG TGTGGCGATG TGCTATGTGC GACGCCGCGC
ATTCCGGGGA AAATCGCGTA G
 
Protein sequence
MSGNTFGTLF TVTSFGESHG PAIGCVVDGC PPGLALCEAD IQAELDRRKP GTSRHVTQRR 
EPDTVEILSG VFEGKTTGTP IGLLIRNQDQ RSKDYGNIAD TFRPGHADYA YTQKYGFRDY
RGGGRSSARE TAVRVAAGAI ARKWLHERFG VAIRGWMSAL GPIEIPFVSA DAIDGNAFFA
PNSAIVPELE AYMDKLRKSL DSVGAKITVT ATGVPPGWGE PVYDRLDAEI AYAMMGINAV
KGVEIGAGFD SVAQKGSEHG DEMTPQGFAT NHAGGVLGGI STGQEIVVNM AIKPTSSIAQ
SRRSINRQGE AIEVATEGRH DPCVGIRATP IAEAMLALVL MDHALRHRAQ CGDVLCATPR
IPGKIA