Gene RoseRS_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0444 
Symbol 
ID5207380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp563079 
End bp564197 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content60% 
IMG OID640594065 
Producttransposase, IS4 family protein 
Protein accessionYP_001274820 
Protein GI148654615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000119461 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0439577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACA CGTACCGCCG GTATCGTGCC ATAGCTCAGT GTTTGCTGCA ACTCTATCCC 
CAGGTCGGTG GGCATCAACG GCGCCATCTG GCGACCTTGG CGCTCTTGAT CTGCGGGATT
GTCGGCAGCC AACACACCCA ATTGCCAAAA GTGGTTGAAC GGACGCCTGG CGGACGCGCC
GCCGACGAGA GTGTCGTGAT GCGTTTTCGA CGCTGGCTCA AACACGACAA CGTAACCTAC
AAGCGCTGGA TGCTGCCCGT TGCCCAAGCA CTTATCGCCA TGTTGGGGCG TCGACCATTG
GTGTTCGTCA TTGATGGGAG TACCGTTGGG CGGGGATGCA TGTGCCTGAT GATCAGCGTG
TTGTATCAGC GTCGGGCGCT TCCGATCACC TGGCTCGTGG TGAAAGCGCG CAAAGGCCAT
CTGCCAGAAG CACTGCATTG TGCGCTGCTC GAGCAACTCG CTCAGCTCGT TCCGGCCGAG
GCGAGCGTGA CGATCTTGGG GGATGGTGAA TATGATGGCG CCGATTGGCA AGCCGCGATT
ACTGCGCGCG GGTGGAAGTA TGTCTGCCGA ACCGCAAGCA ATATCCTGCT GACGCTGGCG
GAGGCGACTA TTGCTCTTGG CGATCTCGCG CCGAAGCGTG GCGAGGTTAT CGCCGTCGAG
CAGGTCTGCA TAACGGCCGC ACAGTACGGT CCGGTTAACG TGCTGGCGGT GTGGGAAGCG
GCCTACGAGC ATCCAATCCA TCTGGTGACG ACGCACGCTG ACGTGGCGTA TGCCTTGGCC
TTGTATCGCC GCCGTGCGCA GATCGAAACC TACTTCTCGG ATCAGAAGAG TCGCGGCTTT
CGGATCAACC GTAGCCATAT CAGTGATCCG ACACGACTTG CGCGCCTGTT GATCGCGACC
GCGCTGGCGT ATCTGTGGGT CGTCTATCTG GGCGTGGTGG CGAGACGGGA TGCGCTGCGT
GGGCGCATCC ATCGACCGGA TCGCTGCGAT CTCAGCTTGT TCTCGCTTGG CTTGCGGCTG
CTAGCCTACT GTCTGCGCCA TCGACGAACC ATCCCGCGCG GGTTGCCCAA ACCACTCTTC
ACGGCATGTC AAACCGCGTT CATATGTTCT GTACGGTAG
 
Protein sequence
MRDTYRRYRA IAQCLLQLYP QVGGHQRRHL ATLALLICGI VGSQHTQLPK VVERTPGGRA 
ADESVVMRFR RWLKHDNVTY KRWMLPVAQA LIAMLGRRPL VFVIDGSTVG RGCMCLMISV
LYQRRALPIT WLVVKARKGH LPEALHCALL EQLAQLVPAE ASVTILGDGE YDGADWQAAI
TARGWKYVCR TASNILLTLA EATIALGDLA PKRGEVIAVE QVCITAAQYG PVNVLAVWEA
AYEHPIHLVT THADVAYALA LYRRRAQIET YFSDQKSRGF RINRSHISDP TRLARLLIAT
ALAYLWVVYL GVVARRDALR GRIHRPDRCD LSLFSLGLRL LAYCLRHRRT IPRGLPKPLF
TACQTAFICS VR