Gene RoseRS_1563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1563 
Symbol 
ID5208518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1911049 
End bp1912158 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content59% 
IMG OID640595169 
Productaminodeoxychorismate lyase 
Protein accessionYP_001275905 
Protein GI148655700 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA CCATTCAGCG AACCTCATTC GCCAAAACGC TGCGTGCGAT CTTTCTCGGC 
GTCGCGCTGC TGGCGCTCAG CGTTGCCTGC GCTGGCTACC TTCTCCTCAG CGAAATCCGT
CGCCCGGCAG GGAATGATGC AACGCCGGTC GAATTCATTG TTGAACCGGG TGATAGCGCC
AGTGTCATCG CTACCCGCCT TGGAACGGCA AACCTGATTC GCCAACCGCT GCTGTTTACC
CTTCTGGTGC GTATGCAAGG TCTCGACAGC GAATTACAGG CCGGTCGCTA TCTGCTGCGT
GCCAACATGA CCATGAGCGA AATCATCGCA GCCTTGCAAA ACAGTCGGGT TGAAGAAGTG
CAGGTGACGA TCATCGAAGG CTCGCGGCTC GAAGAAATCG CCGAGCAGAT CGCCGCAGCC
GGGCTGGTCA ATGTGACCGA GCAGGCATTT CTGCGCACGG CGCGCAACGG TGCAGCGTTT
CAACCGCAGC ACTTCTACCT CAACAGCCTC CCGCCCGGCG CAAGCCTGGA AGGGTATCTG
TTTCCCGATA CCTATCGCTT CGCGGTGACT GCCACGGTCA CCGAGGTGAT CGAAATCATG
CTCGACCGCT TTGATGAGCA GTACGCAACA TTCGAACGCG AGGTGACGGT GAAGGGCGCT
ACGGTTCACG ATATTGTGAC CATGGCGTCG ATTGTGCAGC GTGAGGCGGC GCGTGAAGAT
GAGATGCCCA AAATCGCCGC TGTGTTCTGG AATCGCCTCA AGCCGGAACA TCTTGCCGAA
ACGGGGGGCG GCAAACTGGG CGCCGACCCA ACCGTGCAGT ACATTCTGGG GCAGCGCGGC
AACTGGTGGC CCCGTCTCGA TTCGCTCAGT ATCGATGAAA TCAACGGTAT TGCCAGCCCG
TACAACACAC GTGTCAATCC AGGGTTGCCG CCCGGACCGA TCGCCAGTCC GGGGCTTGCG
GCGCTACGCG CTGCTGCCAG ACCCGACACA TCGGCGCCAT ACCTCTACTT TGTCGCTTCC
TGCACCACGC CCGGCGCACA CAATTTTGCC GTTACGTTTG AGGAGTTTCA GCGCTTCGAG
CGGGAGTACC TGACATGTCC GTCGCGTTAG
 
Protein sequence
MSDTIQRTSF AKTLRAIFLG VALLALSVAC AGYLLLSEIR RPAGNDATPV EFIVEPGDSA 
SVIATRLGTA NLIRQPLLFT LLVRMQGLDS ELQAGRYLLR ANMTMSEIIA ALQNSRVEEV
QVTIIEGSRL EEIAEQIAAA GLVNVTEQAF LRTARNGAAF QPQHFYLNSL PPGASLEGYL
FPDTYRFAVT ATVTEVIEIM LDRFDEQYAT FEREVTVKGA TVHDIVTMAS IVQREAARED
EMPKIAAVFW NRLKPEHLAE TGGGKLGADP TVQYILGQRG NWWPRLDSLS IDEINGIASP
YNTRVNPGLP PGPIASPGLA ALRAAARPDT SAPYLYFVAS CTTPGAHNFA VTFEEFQRFE
REYLTCPSR