Gene RoseRS_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_1903 
Symbol 
ID5208864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2361412 
End bp2362713 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content61% 
IMG OID640595512 
ProductPUCC protein 
Protein accessionYP_001276242 
Protein GI148656037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00453505 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCTGT TCAAGAACAT TCGTCTGGGA TTGCTGCACG TTGCGATTGC GATGACGTTC 
GTGCTGATCA ACAGCGTGCT GAACCGGATC ATGATCCACG ATCTGAACAT TCTGGCGAGC
ATTGTTGCCG TGCTGGTCGT GTTGCCCTAC GTGCTGTCGC CAGCGCAGGT CTGGATCGGG
CAATACTCCG ACACCCACCC GATCTTCGGG TACCGACGCA CGCCGTATAT CGCACTGGGG
ACATTGCTCG CCATCACAGG CGCTGCCCTG GCGCCGCACG CCGCGCTGGC GCTGGCGCGG
GAGCCGCTGA TCGGCGTGCC GCTGGCGGTT CTGCTGTTTG GGATGTGGGG CGTTGGATAC
AATCTCGCAG TGGTCGCCTA CCTGTCGCTC GCCAGCGATA TGTCCACTGA GCAGCAGCGT
TCGCGGACTG TTGCGATCAT GTGGTTCATG ATGATCACCA GCGTCATAGT GACGGCGATT
GTCGTTGGGC GTGCGCTGGA GCCGTACAGC GAAGAGCGTC TCTTCACCGT CTTTCTGGAG
ACAGGCGGCG TGGCGCTGGC GCTGGCGCTT GTGGGGTTGA TCGGTCTCGA GCCGCGCCGC
ACAACGGCGA CCGTGCAGCA GAGCCGCGCC GGGCAGATGG CAGCCATCCG CGCCATTATC
GGCAATCCGC AGGCACGTTT CTTTTTCGTC TATCTCATCA TGCTGCTGGC GGCGATCCTG
GGGCAGGATG TTCTGCTCGA GCCGTTTGGC GCGCAGGCAT TCGGAATGAA TGTCAAAGAA
ACGACGCAAC TGACCGCGAT GTGGGGCGGC GCCACATTGA CGGCATTACT GCTGTACGGT
GCGGTGCTCA GTCGCTGGAT CAGCAAGAAG CGCGGCGCGA TGATCGGCGG TTCGATTGCC
GCAACCGGCT TCCTGCTGAT TGCGCTGAGC GGCATGCTCG CCATCGAAGC CATGTTCATC
CCTGGAATCC TGCTCCTTGG TTTCGGCACC GGCATTGCCA CCACGACCAA CCTGGCGCTG
ATGCTCGATA TGACAACAGC CGAGCAGGTC GGCTTGTTCA TCGGTGCGTG GGGTGTGGCA
GATGCAATCG CCCGTGGCGT CGGCACGTTG CTTGGCGGCG TGATGCGCGA TGTCATTGCC
CATATGAGCG GCAGCGCCGT CAGCGGCTAT GTCAGCGTCT TCCTGATCGA GGCAATGCTG
CTGGGCATTT CTCTGGTATT ATTACAGCGA ATCGATGTGA CCGCCTTCCG CAGCCGCCAA
CCGTCGCTGA CCGAACTGGT TGCGATCACT GGCGATGCCT GA
 
Protein sequence
MTLFKNIRLG LLHVAIAMTF VLINSVLNRI MIHDLNILAS IVAVLVVLPY VLSPAQVWIG 
QYSDTHPIFG YRRTPYIALG TLLAITGAAL APHAALALAR EPLIGVPLAV LLFGMWGVGY
NLAVVAYLSL ASDMSTEQQR SRTVAIMWFM MITSVIVTAI VVGRALEPYS EERLFTVFLE
TGGVALALAL VGLIGLEPRR TTATVQQSRA GQMAAIRAII GNPQARFFFV YLIMLLAAIL
GQDVLLEPFG AQAFGMNVKE TTQLTAMWGG ATLTALLLYG AVLSRWISKK RGAMIGGSIA
ATGFLLIALS GMLAIEAMFI PGILLLGFGT GIATTTNLAL MLDMTTAEQV GLFIGAWGVA
DAIARGVGTL LGGVMRDVIA HMSGSAVSGY VSVFLIEAML LGISLVLLQR IDVTAFRSRQ
PSLTELVAIT GDA