Gene Rcas_4216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4216 
Symbol 
ID5541727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5453613 
End bp5455073 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content65% 
IMG OID640896323 
Productisochorismate synthase 
Protein accessionYP_001434261 
Protein GI156744132 
COG category[H] Coenzyme transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1169] Isochorismate synthase 
TIGRFAM ID[TIGR00543] isochorismate synthases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.192209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.410479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAC CCTTCCAACA ACACTATGGT CGCCTGATCA GCCTGAGCAT GCCGTGCCCT 
GGCGTGTCTC CCGCCGATCT GTTGCGCCAT GCGCGCGGGC AGCCGCGATC ATTCTGGGAG
AGCGCCCGCG ATGGGGTGGC GTTCGCCGGG ATGGGGATCG CGGTCGAACT GATGGCGTGG
GGCGCCAATC GTTTTGTCGA AATCGAGCAG CAGGCGCGCG CGCTGTTCGA GAACGCCGTC
ATGCTCGATG AGCGTGAGCC GCTGGCGGCG CCACGTCTCT TTGGCGGCTT TGCGTTCCAC
AACGATTTCG TGCCCGATCT GGCATGGGCT GATTTCCCGC CAGCACATTT TGTGCTGCCA
CACTACCAAC TGGTGCGCGT TCGCGATTCG TTCTGGTTGA CGCTGAACGT CCACGCGCCA
CCTGGCGAGG ACCCGCGCGC GCTAGCGCCC GACTTGCGCG AGGCGCTGCT GGCGCAGGTC
GATGCGCTTC AGAGCGAGCC GCCGCCACTG CCGCCGCGTT CGTCGGCGCG CCTTGCATAT
CCCATGCCGT TCGAGCAGTG GGCGCGCAGT GTGGAACGGA TTGTCCGGCA GATCAACGTC
GGTGAATTGA AGAAGGTCGT GCTGGCGCGG ATTGCCGAGG CATCGTTCGA CGCGCCGGTG
GATGTCGATA GCGCCCTGGC GTGTCTGGCA CAGCGCTACC CCGACACGTA TCGCTTTCTC
TTTGAGCCGC GTCCGGGGCG CGCATTCTTC GGCGCAACGC CGGAATTGCT GGCGCAGGTG
AACGGCGACC GGGTGACGAC AATGGCGCTG GCAGGCAGCA TCCGGCGTGG CGCAACACCC
GATGAAGATG AGCGTCTTGC TTTGGCGCTG CTCGATAGCG CAAAGGATCG CCACGAGCAT
CAGATTGTGG TCGATGAGGT GCGCAATCGT CTGGCGTCGC TGACCAGGCG CCTGGATGTG
GGAGCAACTG ATGTGATGCG GTTGAGCAAT ATTCAGCACC TGCACACGCC AATCAGTGGC
GTACTGCGCG AGCCACGCGG CATTCTGCCG ATCATTGCGA CGCTCCACCC AACGCCTGCG
CTCGGCGGTG AGCCGCGCGC GGCGGCGATG CGCCTGATCG CCGAACTGGA ACCGGCGCCG
CGTGGCTGGT ATGCTGCGCC CGTCGGCTGG ATCGACCGGC GCCTGGATGG GCAGTTCGGG
GTTGCCATTC GCTCGGCAGT GGTACAGGCG ACCCGCGCCT GGTTGTACGC CGGCGCCGGT
ATCGTTGCCG CAAGCGATCC GCAACGCGAG TGGGACGAAA CGAACCTTAA GTTCCGTCCG
ATGCTCGAGG GGTTGGGGCA CACGGATCGC CACGGGTTGG GGCACCTGGT AGGGACACGG
ATCGCCACGG ATACGACGGA TCGCCACGGG TTGGGGCACA CGGATCGCCA CGGATACGAC
GGATCGCCAC AGATTAAGTA A
 
Protein sequence
MTKPFQQHYG RLISLSMPCP GVSPADLLRH ARGQPRSFWE SARDGVAFAG MGIAVELMAW 
GANRFVEIEQ QARALFENAV MLDEREPLAA PRLFGGFAFH NDFVPDLAWA DFPPAHFVLP
HYQLVRVRDS FWLTLNVHAP PGEDPRALAP DLREALLAQV DALQSEPPPL PPRSSARLAY
PMPFEQWARS VERIVRQINV GELKKVVLAR IAEASFDAPV DVDSALACLA QRYPDTYRFL
FEPRPGRAFF GATPELLAQV NGDRVTTMAL AGSIRRGATP DEDERLALAL LDSAKDRHEH
QIVVDEVRNR LASLTRRLDV GATDVMRLSN IQHLHTPISG VLREPRGILP IIATLHPTPA
LGGEPRAAAM RLIAELEPAP RGWYAAPVGW IDRRLDGQFG VAIRSAVVQA TRAWLYAGAG
IVAASDPQRE WDETNLKFRP MLEGLGHTDR HGLGHLVGTR IATDTTDRHG LGHTDRHGYD
GSPQIK