Gene RoseRS_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3072 
Symbol 
ID5210040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3859222 
End bp3860760 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content58% 
IMG OID640596663 
Producthypothetical protein 
Protein accessionYP_001277385 
Protein GI148657180 
COG category[R] General function prediction only 
COG ID[COG3042] Putative hemolysin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00570768 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00425561 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTATGCAT CCTCACGTCG ATTGCAGTGG CGGTTGACCC TGGTCGTGAT GGTTGTGTTC 
GCGCTGCTGA GCAGTGGCTG CTCACCCTTC CCCAGCCAGC CTGCCACACT GACACCCCCT
CAGCGCGTGA CGGAATCGCC GCCGGTGGTC AGCCCGACAC CGGTTCCAGC GACAACACCT
CCCGTCACGC CAACCCCGCT GGCGGCGCTC ACCGACCCGG CTTCGCTCTA CTGCATCGAG
AAAGGTGGAC GGTTGGAGAT CCGCTCCAGT GGCAACAGCA GTCATATTGG TGTCTGTATC
CTTGATGATG GCACTCTGTG TGAACAATGG TCGTTCTATC GCGGCGAATG CCAGTCTGAT
CAAACCTACA ACCTGCCGCC TGCGCCGCCA TCGGCGGATG ATAAGGTCTT TGCAGCGTTG
TTTGCGGACG TGCGCCGCCG GTTGCCGCCT GAGGCCTTCA CCGATTTTGC CGCTCAACCG
ATGCGCAGCG ATGATGGACG GCAGTGGTGG GTGGTCTACA GTCTCGACAT GCGCAATTTT
GGACTCGATG AGAGGGCAGC GCATTTCGTT GCGATCTACT CCTATGACGA CAACCGATGG
CAGGAACGAG CGCGAGTAAC GCTTGTCAAA CAACAGACCG GGGTCAACCT GGAACCGGCT
TATCTCGAGG AGGTTCGACA GGTGGCTATC GCGCCCGGTA AACTATGGAT TCAGGTTGAG
GGCGGTCTGG GTCTGCATAG CGGGAGTTAT CACCTGTTGA GTTTTGATGG CGTCACCCTG
CAACCAGAAG TGGTTGCGTT TTCGTCATCG CCCAATGTCG GTTTTGTGAC CGATCTCAAC
GGTGACGGTC TCAATGAAGT GGTGCTCAGG CGGTTTGAGT ACTATATTTT TTGCTACGCC
TGCGAGGTCT ATTACCCATT CTGTGAGGTA TATACCTGGC AGAATGATGA CCTGGTTTTA
CTCGCAATCT CCGATCTGAC CGCCGGGTAT CAGGGGTCTC CCTTTGCCGA ACTCAACCGG
AAAGCGGTTG CTTTCGTGCA GGCTGATCTG TGGGCTGAGG CTCTGCGCGC GATCAATGAT
GCCGTGGCGC AGGCCGGTGC AACCGATCCG CCGACAACCG CCGGTTCATT ACGCTGGAAT
CAGCGCCTGA TCCGGATGAT CCACGATGCA CACCGTGAAG CTATTGAAAC CAGCGCCTAC
CCGCTCATCA ACGAGGTGTT CTATGGCGAC TATGCGCGTG CAGTAGAGCA TATGCGCGCG
TATCAGGCAG CCGATCTGTT CCGCATCGAT TCGCCGCTGA TCGTCGGGAC GGTCGCCGAG
GGGTGGACAG AAAATCTTGG CGAGTATCTG GTTAATCATA CCGGGCGCGC CCTGGCGGTT
GTTCCGGATC GCGCCGAGAT CTATTTTGTC GGGGCATGGG GCAAGTTTCT TGTCAATCCT
GACGATCCGG CCATCGGCGC CGATCTCGAA CGCGCTGCAC GGTTGCAGCC GAACGATCAG
TTCTTTGCCG ATGCCTTCGC CTGGTGGCAG GATCGTTAG
 
Protein sequence
MYASSRRLQW RLTLVVMVVF ALLSSGCSPF PSQPATLTPP QRVTESPPVV SPTPVPATTP 
PVTPTPLAAL TDPASLYCIE KGGRLEIRSS GNSSHIGVCI LDDGTLCEQW SFYRGECQSD
QTYNLPPAPP SADDKVFAAL FADVRRRLPP EAFTDFAAQP MRSDDGRQWW VVYSLDMRNF
GLDERAAHFV AIYSYDDNRW QERARVTLVK QQTGVNLEPA YLEEVRQVAI APGKLWIQVE
GGLGLHSGSY HLLSFDGVTL QPEVVAFSSS PNVGFVTDLN GDGLNEVVLR RFEYYIFCYA
CEVYYPFCEV YTWQNDDLVL LAISDLTAGY QGSPFAELNR KAVAFVQADL WAEALRAIND
AVAQAGATDP PTTAGSLRWN QRLIRMIHDA HREAIETSAY PLINEVFYGD YARAVEHMRA
YQAADLFRID SPLIVGTVAE GWTENLGEYL VNHTGRALAV VPDRAEIYFV GAWGKFLVNP
DDPAIGADLE RAARLQPNDQ FFADAFAWWQ DR