Gene RoseRS_3152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3152 
Symbol 
ID5210122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3965354 
End bp3966994 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content63% 
IMG OID640596743 
ProductO-antigen polymerase 
Protein accessionYP_001277463 
Protein GI148657258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGT TGAACGCGGT TCCGTTTTTG ATGCCACTGA TACTGTTTGT CACCGGCGGT 
CTCCTGGGAA CCCTCGTCGC CTGCGATCCG GAGCAGAGCC TGCCGTGGAT GCTGACGCTG
ACACTGGGAA TAATACTCTA TCTGGGGATT GTGACGTTCC TGCGCAACCG ATTGATGAGC
GTCGCTGCGG GAATCGGCGC CTGGAGCATC AGTTACGGTG TGCTGCTGGC GGTGCAGTAT
CGCCATCTTG GTTTTCACGA AAAGTTGGGG CTGTCCGCCT GGCTGGGGCG ACTGACCAGT
GCACCTTTCC CAGATGTGAC GCCGGTGTTC ATCGATGCAA ATGCAGCGGC ATCGTTTCTT
GCACCAGCGC TCCCGCTGAT CGTCGGCATA GCGCTGAGCG TCCATGGCGC ACAACGCAAA
GCCTGGAGCG TCGGTGCCGC TCTGGTGGCG TTCGGCGTGC TGCTGACGTC TTCGCGTGGT
GCGTTTGTTG CGCTGATCGC CGCTGGCTTG CTCTGGGGGT TGGTGCGCCT CCAGAGATAT
GCGCAGGATT CCGACGCACG CATCCCACGC TCTATCTGGC GCAGCCTGGT TGTTGCCGGC
GCGGCTGGCA TCGGTCTCAT CGCGCTGCTG GTCAACCTCC CCGCGACGCA GGATGCACTT
GCGTCCGCTG CGCTGCGTGC CGCAGACCGC CTGGCAGTCT ACCGCAACAG TTTCTTTCTG
GCGCTCGACT TTCCGTTCAG CGGCATCGGA CCGGGCGCCT TCGGTCCGAT GTATTCGCGC
TTTCAGTTGC TCATTCTTCC CACGTTCATC AGTTATGCGC ACAACCTGTT TCTCGGCGTC
TGGCTGGCGC AAGGCATTGT CGGGCTGATC GGCTTCGCCT GGCTGCTCAT CGCTTCGTTC
CGGCGAACTG CACCCATACT GCACGCACAA ACGCCGCTGG TTCAGGGCGC AGCAATCGGG
TGCGTGGTTC TACTGATCCA CGGGCTTTCC GATGCGCCAC AGTACGACAC GTCCTGGACC
ACGATGCTGC TGGCGTTCGG TCTCTTCAGC ATCGTGGCTG CGGCACCGCA CCAACCGGTT
GACATCCCCG CGCGCGCCGC CCGGCGACCC AATCCACACC GTCGGAGGCA GGTCGGGATC
ACGATTGCTG CGGTCGCGCT GGTTCTGATC GTGAGTGGAC CGCATCTCGC GGCTGCTGGC
GCCGTGAACT ATGCCGCCTG GCTCCAATCG CGCGCGATGC TTGCCGACGA GTTGACGCAG
AAGGAACGCA CCACGCTGAT GCACGACGCT GTGACGTGGA TCAACTACGG TTTGCAGATC
GCCCCCGCTT CGCCGCTGGT TCAGAAACGG CTGGGCATGC TGGCGCTCGA TCTGGGGGAT
TATCCACGGG CGATCAATGC GCTCGAACAG GCGCAGACAT GGTTCGCCGC CGATCAGGCG
ACCCACAAGG CGCTCGGCAT GGCGTATGTG TGGCATGGCG AACCGGATCG GGGCGCCAGG
ATGCTGGCGC ACCTCGATCA GGCGTCCGAA GTGCGCGAAG AACTCGCTAT CTGGGTCTAT
GCCTGGCGCG AGCGGGGACG GGACGACCTT GCCGCGTATG CGAAACGCGC AGCCGAAGTA
ATGACGGAAG CGCCGCGTTG A
 
Protein sequence
MKQLNAVPFL MPLILFVTGG LLGTLVACDP EQSLPWMLTL TLGIILYLGI VTFLRNRLMS 
VAAGIGAWSI SYGVLLAVQY RHLGFHEKLG LSAWLGRLTS APFPDVTPVF IDANAAASFL
APALPLIVGI ALSVHGAQRK AWSVGAALVA FGVLLTSSRG AFVALIAAGL LWGLVRLQRY
AQDSDARIPR SIWRSLVVAG AAGIGLIALL VNLPATQDAL ASAALRAADR LAVYRNSFFL
ALDFPFSGIG PGAFGPMYSR FQLLILPTFI SYAHNLFLGV WLAQGIVGLI GFAWLLIASF
RRTAPILHAQ TPLVQGAAIG CVVLLIHGLS DAPQYDTSWT TMLLAFGLFS IVAAAPHQPV
DIPARAARRP NPHRRRQVGI TIAAVALVLI VSGPHLAAAG AVNYAAWLQS RAMLADELTQ
KERTTLMHDA VTWINYGLQI APASPLVQKR LGMLALDLGD YPRAINALEQ AQTWFAADQA
THKALGMAYV WHGEPDRGAR MLAHLDQASE VREELAIWVY AWRERGRDDL AAYAKRAAEV
MTEAPR