Gene Rcas_3830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3830 
Symbol 
ID5541333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5004407 
End bp5005483 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID640895940 
Productchorismate synthase 
Protein accessionYP_001433886 
Protein GI156743757 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.929225 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGGAA ATACCTTTGG ACAGGTTTTT CGGTTGACTA CCTGGGGGGA GTCGCACGGA 
CCGGCAGTGG GGTGCGTGGT GGATGGATGC CCCGCAGGTC TCGACATCTC GGAAGACTAT
ATTCAGCATG AACTGAATCG CCGACGGGTC GGGCAGAGCC GGGTGACATC GGCGCGTCAA
GAATCCGACC AGGTGCAGAT TCTGTCTGGC GTCTTCGAGG GCCGCGCGAC CGGCGCGCCC
ATCAGTATGC TGGTGTTCAA CACCGATGCG AAGCCGGGGC ACTACGAAAA TATCAAAGAC
CTCTACCGCC CCGGTCATGC CGATTACACC TGGGATGTCA AATATGGCTT CCGCGACTGG
CGTGGTGGCG GGCGTAGCAG CGCGCGCGAG ACGATAGGGC GCGTTGCCGG CGGCGCAGTT
GCGAAACGCC TCCTGGCGCA GCACGGCGTA TCGATTATTG CCTGGACGGC ACAACTCGGC
GATCTGAAGG CCGAGGTGAT CGACGAGAGC GAAATCGAGC GTAATATCAT GCGCTGCCCG
GATGCGCGCG TCGCAGCGCT GATGGTCGAG CGGGTCGAAC AGGCGCGCCG CAGCCTTGAC
TCGCTCGGTG GTATCGTCGA AGTGCGAGCG CGCGGCGTTC CCCCCGGTCT CGGCGAACCG
GTCTTCGACA AACTTCAGGC GGATATTGGC AAGGCAATGT TCAGCATCCC GGCAATTAAG
GGCGTTGAGT TCGGCGAAGG GTTCGGTGTA GCGCATATGA CCGGCTCTGT CCACAATGAT
CCTTTCGAGC GTCGCGCCGA TGGCACAATT GGAACATCGT CCAACCACCA CGGTGGCATT
CTCGGCGGGA TCAGCACCGG TGAAGAAATT GTGCTGCGCA TTGCTGCCAA GCCTCCCGCT
TCGATTGCTC GACTGCAACG CACCGTTGAC CGTGAGGGAA ATCCGACGGA GATCGAAATC
CACGGGCGCC ACGACCCAAC GGTGCTCCCG CGGTTGGTTC CAATCGCCGA GGCGATGCTG
GCGTTGGTGC TCGCCGATCA TCTGCTGCGT CAGCGCCTGG CACGGATGGA GAGATGA
 
Protein sequence
MPGNTFGQVF RLTTWGESHG PAVGCVVDGC PAGLDISEDY IQHELNRRRV GQSRVTSARQ 
ESDQVQILSG VFEGRATGAP ISMLVFNTDA KPGHYENIKD LYRPGHADYT WDVKYGFRDW
RGGGRSSARE TIGRVAGGAV AKRLLAQHGV SIIAWTAQLG DLKAEVIDES EIERNIMRCP
DARVAALMVE RVEQARRSLD SLGGIVEVRA RGVPPGLGEP VFDKLQADIG KAMFSIPAIK
GVEFGEGFGV AHMTGSVHND PFERRADGTI GTSSNHHGGI LGGISTGEEI VLRIAAKPPA
SIARLQRTVD REGNPTEIEI HGRHDPTVLP RLVPIAEAML ALVLADHLLR QRLARMER