Gene RoseRS_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4007 
Symbol 
ID5210990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5012044 
End bp5014974 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content65% 
IMG OID640597596 
Producthypothetical protein 
Protein accessionYP_001278302 
Protein GI148658097 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.109226 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG TTGCACTCCG TCCCTCGCCT TCAAACCCGC CGCCGGCGAC CCGTCCGCCG 
CGTCCCGACC TGCCGTTCCA TCTCGTGCGC GCGCTGGCGT TGATCTGCAT TGTGCTGGCG
CTTGTCCAAC CGCTGGCGCT GCCGCAGGTT GCCGACTCGC CTTCATTCCA GGCGATCCTG
TTACCCCGTG TCTTGCACGA GCCAGGAGTC GTTGCGCCAG CATTGCTGCG GCGTGCATTC
GAGCCGCCGC TGTTCCCGAT GATCAGCGCC GTGGCAGTGG CGTCGCTGTG CATCCTTGCG
GGGGAACTCA GCGTGGCGCT GATCGCGGCG ATCCTGGCGC GCCACGCTGC ACGTCGCGCT
ACGCGCTCGA CGATCTGCCT GCGGGTTCGC CCCGCGCATA CCCTCGCCGG GACGAGCGCA
TCGGTCGCCA GACCAGGCGC GCTTATGCGT ATGATTCATG GCGGCATGGC GAAGCGTTTC
TGGATGCATC CTGCTCCACC GTATACCCTG ATCATCAGCG GCGCGCCAGA TCTCCCCGCA
GAACTGGGGG CGCTGATCAG CGGCACGCCA GACGATCAAC AACGCGCTAT GGGTGTGCTG
GACGGCGCAG TACGCAGCAG CGCTCCGGGC GCACAGATCG ATGCCACCGA TGATCCGCTC
ATTGCAGCGG CAACTCCTGG ACGCTGGATC GCCTGGCAGC GCTTCGGTCT GGCGCTGCCG
CCAGCCTATC CGCTCCATAC ACCGGCGCTC GTTGTTGAAC ATGAACTGGC AGGAGTATTG
CTGGCGGCAG TCCGACCCCA CGGCAGCGTA GTGCACGCCG GGCTGGAGAT CGCCCTGCGC
CCGCAGGGTG GCATGACCGG ATGGGCGCTG GGGAGGCAAT GGCGCGCACG TGCGATGGCG
TTGAAACTGA CGCTTGAACA GCGCCAGGAT TATGCGCTTG CGCCGGATAT TGCCGCCATC
GAGGCAAAAC TGGGCGATGC CGCATTCGAG GCGACCATCG TGGCAACCGC CGTCGCCGGG
CAGCGTGATG ACGCAATCAC TGCGCTCCGC GCCATCGGTG ATGCGCTTGG CGCATTCCAG
CAACGCACCG CAAGCCGCTT GCAGCGTCTC GTCCCGCTTG CCCGTGCGTC GGTCCGTCAG
ATAACCGTGG GTCAGGAGAT GCAGATCGTA TCGCACCTGC GCACGCCGCC GGTCTCACAT
CCCCCCACGC TGCTACTGCC GTTCCGTCTC TGGCGTGGAC CGGACATTTT GAGCGCCGGA
GAAGTGGGGT ATCTGTGGAA TCTTTCCAAT CCTCAGTCGA ACGGATTGAT GCGCTGTGAT
CCGTGCCGCC GGATTGCAGC GCCGCCTCAC GCCTTCTGCG CCGACGATCC GGAGCGTATC
ATCGTCGGAT ACGCCGCGCA TACCGATGGA CACAGCGCAC CGGTTGGACC GACGCTGCGC
GATCTGCGTC AGATTCTGCA CCTGACCGCC GGTATGGGCG CCGGAAAGAG TCGTTTGCTG
GCGAACCTGT GTCAGCAACT GATCCCACAC GGATTCATGC TCATCGACGG CAAAGGCGAT
GATCGTGGCG GTAGTCTGGT TGATGTCGTG CGTCAACTGA TTCCGACGGT GGACGAAGGA
CGGTTCATCC TGCTCGATCC GCTCGACACG GCATGGCCCA TCGGACTGAA CCCGCTCGCG
GGCGCCGATG TCACGCAACC GGGCGGCGCC GATCTGGCGC TGGGGCAAAT CCTGGCAACC
TTCGCGCGCA TCGATCCTGA TACGTGGGCG CGATCGCCGG GCATGCAGCA GTTCGCGCGT
ATGGCAGCGC TGCTGGTGCT GGAGGGCGAA ACGCATCCCA CACTGGCGCA CGTCAAACAG
GCGCTGATCG ACGAAGCGTA CCGCCAGGAA CTGCTCCGGT CGGCGCGGAA CATCGAGGTC
GCCAGTTTCT GGCTTGAGAC ATACCCGCGC CTGGGTGAGG GGCAGCGATC CAGTTGCGAC
GCGCTGTTGA GGCGTTTCGA TGCGCTGTTG ACCGCTGAAA CGACGCGCTA TCTGGTGGCG
CAGGCACGCC CCACGCTCGA TCTGGCGCAG ATGATTGCCG ACCGGATGAT CGTGCTGGTT
CCGCTTCCCG ATGTGGCGCT TGGCGGGCTG GCAGGCGCGG TCGGCATGCT GATCGCCCAG
GCATTTGTGC GTGCCGCCTT CAGCCGTAGC GGCAGCGACC GCACCCGCCA CGATTATCCG
TTGATCATCG ATGAACTCCA GGTGCTGATC GGCAACAGCG ACACAACCGA TATAGCGACT
GCCATCACGC GCCTGCGGTC GCTCGGCATT CCGACGATCT ACGCCCATCA GGCGCTGGCG
CAGTTGGGTG ATCTCCGCGA CCTGATGCTG ATCAATGCCG GGAACCGCAT CATGCTGCAA
ACCCAGGAAC CCGATGCCAG CATCTATGCG CGCGCCTACG GCGCCAGCGG TCTCACCGCC
GCCGATCTGA GCGGGCAACC GCCGAACGAC CATCAGTACG CCGTCCTGCG CTGCCGGGGC
GTGGTGGCCG GACCTTTTTC GATGCAACCG CTCCCCTGGC CCGCTGTGCA CGACGAGTCG
CCGCCGCCAT ACCAGGGACC GGCGTGGCGC GACGTTCTGC CCGACGACGC CGATCCGGCG
GACCGGTTCA TTGCGCGCGT CGTCTACGAT GCCGGTACAG GCGCAACTGC CGCCAATGAA
CTGGCGCACC TCAGCGACGC CGACTGGCAG CGACTTCTGC GGCGGTGGGA GCGGATCAGG
CACGTTCAAC GGCAGCACAT CCTTGCGCAT CCCGGTTGTA TCGCCGACCG ACTGGAACGT
CAACGCTGGC TATCGCGTCT CTATGCGGCG CGTCCGCGGG TGCTGGCTGC CGCCGAATAC
CTGCGCGGGC GTCAGAAAAG ATCGCATCAG CACGCCTTAA ACTCGACGTG A
 
Protein sequence
MSRVALRPSP SNPPPATRPP RPDLPFHLVR ALALICIVLA LVQPLALPQV ADSPSFQAIL 
LPRVLHEPGV VAPALLRRAF EPPLFPMISA VAVASLCILA GELSVALIAA ILARHAARRA
TRSTICLRVR PAHTLAGTSA SVARPGALMR MIHGGMAKRF WMHPAPPYTL IISGAPDLPA
ELGALISGTP DDQQRAMGVL DGAVRSSAPG AQIDATDDPL IAAATPGRWI AWQRFGLALP
PAYPLHTPAL VVEHELAGVL LAAVRPHGSV VHAGLEIALR PQGGMTGWAL GRQWRARAMA
LKLTLEQRQD YALAPDIAAI EAKLGDAAFE ATIVATAVAG QRDDAITALR AIGDALGAFQ
QRTASRLQRL VPLARASVRQ ITVGQEMQIV SHLRTPPVSH PPTLLLPFRL WRGPDILSAG
EVGYLWNLSN PQSNGLMRCD PCRRIAAPPH AFCADDPERI IVGYAAHTDG HSAPVGPTLR
DLRQILHLTA GMGAGKSRLL ANLCQQLIPH GFMLIDGKGD DRGGSLVDVV RQLIPTVDEG
RFILLDPLDT AWPIGLNPLA GADVTQPGGA DLALGQILAT FARIDPDTWA RSPGMQQFAR
MAALLVLEGE THPTLAHVKQ ALIDEAYRQE LLRSARNIEV ASFWLETYPR LGEGQRSSCD
ALLRRFDALL TAETTRYLVA QARPTLDLAQ MIADRMIVLV PLPDVALGGL AGAVGMLIAQ
AFVRAAFSRS GSDRTRHDYP LIIDELQVLI GNSDTTDIAT AITRLRSLGI PTIYAHQALA
QLGDLRDLML INAGNRIMLQ TQEPDASIYA RAYGASGLTA ADLSGQPPND HQYAVLRCRG
VVAGPFSMQP LPWPAVHDES PPPYQGPAWR DVLPDDADPA DRFIARVVYD AGTGATAANE
LAHLSDADWQ RLLRRWERIR HVQRQHILAH PGCIADRLER QRWLSRLYAA RPRVLAAAEY
LRGRQKRSHQ HALNST