Gene RoseRS_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1021 
Symbol 
ID5207967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1255944 
End bp1257368 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content68% 
IMG OID640594635 
Producthypothetical protein 
Protein accessionYP_001275380 
Protein GI148655175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCT CGCATGCCGA ACTGATCACG CTCACCTGCC CCGCATGCGG CACATCACAC 
ACCGCCGACA TCTGGCTGAT CGTCGCCCCT GACGAACGCC CCGACCTGGT TGAACAGATC
CGTAACGGCA CTCTGCATAC TGCGACATGC CCGCAGTGTG GCCAGACCCG CACCCTCGAC
GCCCCGCTGC TGATCTTCCG CCCCACCGCC GAGCCGCCCA TCCTCTTCGC ACCTGCGCAG
CAAACAAGCA CAGAGCAAGA TCAACAACAT GCCGCAGAAT TGATCGGCAT CCTGCGCCAG
CGCCTGGGCG CAGCGTGGAA CGACGCCTGG CTTGGCCAGG GACTGACAGT CGTGCCGCGC
CAGATACTGC CGCTGGCGCT GGCCGACGAT CCGGCAGCAG CGTTGCAGGA GCTGGCCGTC
CAGATGCAGC AGGCGCTCGC CGAACTGCGC CGGCGCGACC CGGAGGCGTT CGCCCGGCTC
GAGGCAGAAG CGCAGCAGGC AATGGCAGCA TTGCTGGCAG CCGACCAGAC GCCGGATGAA
GCGCCAGCGA GCGCCGCGAC GACCGACGCG CCTGCGTTGA TCCAGGCTTT AGACGCCTTC
CTCAACGCCC GCACCTGGAT CGACAGTTAC CGGCAAGTGC AGGCCCACCC CGAACTGCTG
AGCGACGAGG CGCTGGCGCT GCTCGAGCAG CGCATTGCCG CTGCCCGCAC GGTGGGCAAC
TCCCGCGCCG TCGCCTTCTT CGAGGAACAC CTGTCTCTGC TGCGGCGTTG CCGCGAGGTC
GGCATCCCAC GCGCCTTCGC CGCGAAGATG CTACCGCCCG AGACGCTGGC GCAGGCTGAG
GCGGCCGGGC TGACGCCGGC GCAGGCGATC GAGGCTGCGG CGCAGGCGAT GGACATAGGC
AGCGGCGTGG ATGTCCCGTC CCGATTCCGT AATGATCTGC AGCAAGCCAA GGAAGCCGAG
CAGCGCTACC GGCGCACAGG CGACCGTGCC GCGTTGGATG CCGCGGCTGC GGCCTGGCAG
CGGGTGCTCG ATGATCCCGC TTTTGCGCGT AGCCAGGAAC GCTTTCAACT GGCAGCTATG
AATGATGCAG GCGTTATCTT CTTGCAGCGT TATTGGTCAG TGGGTCACAT TGCCGATTTA
AATCGCGCTA TCGAGTTGTG GCAACGGGCG GTCGAGCTCA CCCCGCCCGA CTCCCCCGAC
CGCCCCGCTC GGCTGAACAA CCTGGGGACC GGGCTGCGCG CCCGCTATGC CCGCAGCGGG
CGGCTGGAGG ACCTGGATGC GGCCATTGCC GCCTGGCAGC AGGCGCTGGA TGCCACCCCG
CCCGACTCCC CCGCCCGACT CCCCCGACCG CCCCGCTCGG CTGAACAACC TGGGGACCGG
GCTGCGCGAC CGCTATGCCC GCAGCGGGCG GCTGGAGGAG CTTAA
 
Protein sequence
MPISHAELIT LTCPACGTSH TADIWLIVAP DERPDLVEQI RNGTLHTATC PQCGQTRTLD 
APLLIFRPTA EPPILFAPAQ QTSTEQDQQH AAELIGILRQ RLGAAWNDAW LGQGLTVVPR
QILPLALADD PAAALQELAV QMQQALAELR RRDPEAFARL EAEAQQAMAA LLAADQTPDE
APASAATTDA PALIQALDAF LNARTWIDSY RQVQAHPELL SDEALALLEQ RIAAARTVGN
SRAVAFFEEH LSLLRRCREV GIPRAFAAKM LPPETLAQAE AAGLTPAQAI EAAAQAMDIG
SGVDVPSRFR NDLQQAKEAE QRYRRTGDRA ALDAAAAAWQ RVLDDPAFAR SQERFQLAAM
NDAGVIFLQR YWSVGHIADL NRAIELWQRA VELTPPDSPD RPARLNNLGT GLRARYARSG
RLEDLDAAIA AWQQALDATP PDSPARLPRP PRSAEQPGDR AARPLCPQRA AGGA