Gene RoseRS_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2010 
Symbol 
ID5208972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2498416 
End bp2500002 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content69% 
IMG OID640595617 
Producthypothetical protein 
Protein accessionYP_001276346 
Protein GI148656141 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00501978 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGAGA TGCCACGCTC CCATGACCTC AAAGTCGAAC CGCGCGCGCT GGCGCGCTTC 
ATCCCGCGCA CGGTTCCGCC TGCGTTGCGC CTGCCGCCGG AGATCAACCG CCTCGACCTG
CTGCACCAGG AGGGCGTCGA GGCGCTGCTG CGCGAACTCT ACAATCTGCT GCGCGACCAG
GGCATCACCT GTGATGTTGA GCCGCCCGCC CCGGCGATGG CGGTGCGCCA GCCGATCCGC
ACCATCGACA CGACTCTGAC TGAGAAGCGC GGTACGTGTC TCGACCTCTC GCTCCTCTTC
TGCGCCGTCT GCCTGGCGCA CGACCTGGCG CCGCTGCTGA TTGTGCTGGA GGGGCATGCG
TTTGTTGCTG TTACGACCGG ACGAACGCTC CAACAACCGC ATGGCGAAGG CATGCCGGCG
TTCGAGCGAG GCATGCTCGC CAGTTTCGAC GCACTGCGCG ACCAGGTTCC CCGGCTCTAT
CTGCCGGTCG AGTGTACCGG TTTCGCTGCC GGGGCCGGTC TGAGCCGGGA GTATCCCGAA
GGGCGCGGGC GCGAACGGCG CGATGGGTGC ATGTCGTTTG ATCGCGCCGT GCGGGCCGGC
GAGGAATACC TCAATGCCCA CATCGCGCCC GCCGGCGCGA CCCCCGGCGC AGCGCAGCGC
GCCTTCCTGT ACGCGCTCGA TATTGTAACG TTGCAGGATC GGTACGGCTT CATGCCGGTC
GGCGACACGC TGCCGGGAGA TACGCGGGTG TACCTGGACT CGGCGCACGC CGAAGGCGGC
GCTGCCAGCG TTCGCAATCA GGGAGCGGGC GAAGCGGCGC CGTCCGCGCT GCCTGATGCC
GACCGGCTCT ACCAGCGTTC GGCGCATGCC GAAGGCGGCG GTGATGCGCG GGTGGAGAAC
CAGGGCGGCA GCGTTGCACC GCCACCGGCA AAGGCACGTC GCATCTACAG CGGTTCGGCG
CACGCCGAAG GCGGCGGTAA TGCCACCGTG CTCAACAACG CGCCTCAACC TGTGGCGCGC
GTTGAACCGG CGACGGTGCT GGCGGTGTAT GCTGCGCCGC CCGGCAGCGC GCTGCTGCAC
TGGGAACGCG ATGTGCGCGC GCTGGGCAAG GCGCTTGCGC CCTACCCCGA CCGCTTCCGT
CTCGACGTTG TGCCGCTGGC GACGCCGGAA GACGTGCAGC GCGCGCTGGT GCAGTTTCGC
CCGCGCTATC TGCATCTCTT CGCCCATGGC GCAGTTGATG GCATCCTGCT CGATGACGGC
GAGGGCGGGC GTGGCTGGAA ACTCCCTTAC CCGCTGCTGG CGGAAATGGT GCGCGCCACG
CCTGGCCTCC GCTGCGTCCT GCTCAGCGCC TGCGACTCGG CCTATGCCGC GAGTGCAACC
GGCAGCGGCG AGCCGTACCT GATCGCCATG CGTGGCCAGG TCAGCGTCGA TGCCGCGATC
GCCTTTGCCG GGGGGTTCTA TGAAGCCCTC GCCGCGCGCG AGGACACTCC GATTGAAGCG
GCCTTCGCTC AGGGGCTGAT CCGGCTGAAG CTGATTGCGC CGCTGGATGC GGAGGTTCCG
CTGCTGGCGG CGGGCTGGCG CGGGTGA
 
Protein sequence
MPEMPRSHDL KVEPRALARF IPRTVPPALR LPPEINRLDL LHQEGVEALL RELYNLLRDQ 
GITCDVEPPA PAMAVRQPIR TIDTTLTEKR GTCLDLSLLF CAVCLAHDLA PLLIVLEGHA
FVAVTTGRTL QQPHGEGMPA FERGMLASFD ALRDQVPRLY LPVECTGFAA GAGLSREYPE
GRGRERRDGC MSFDRAVRAG EEYLNAHIAP AGATPGAAQR AFLYALDIVT LQDRYGFMPV
GDTLPGDTRV YLDSAHAEGG AASVRNQGAG EAAPSALPDA DRLYQRSAHA EGGGDARVEN
QGGSVAPPPA KARRIYSGSA HAEGGGNATV LNNAPQPVAR VEPATVLAVY AAPPGSALLH
WERDVRALGK ALAPYPDRFR LDVVPLATPE DVQRALVQFR PRYLHLFAHG AVDGILLDDG
EGGRGWKLPY PLLAEMVRAT PGLRCVLLSA CDSAYAASAT GSGEPYLIAM RGQVSVDAAI
AFAGGFYEAL AAREDTPIEA AFAQGLIRLK LIAPLDAEVP LLAAGWRG