Gene Rsph17029_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2029 
Symbol 
ID4897657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2148890 
End bp2151025 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content50% 
IMG OID640112622 
Producthelicase domain-containing protein 
Protein accessionYP_001043904 
Protein GI126462790 
COG category[R] General function prediction only 
COG ID[COG1204] Superfamily II helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.434409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTGT TAGGAACTCT CCGTGGCATC TCCGAAACGA CGCCTCTTTC GGCTGACGAT 
GTTTTTGAAG CTATACTCGA AGCAAGCAGA ATTTCGAACT CAGCGCGTTA CGAAGACCCG
GAAGCGCTGG AGATCGCGAT CCGTCTTCTT GATGCGAGCG AAAGAGGTTT GGTGCCCTTC
GGTTCTCACA CGGCCATTGA GCATCTCGCC GAAGAATGTG GGCTATTTCC GTATCTCGAC
ATAGACACCC TCACCCCATT TCGCGCAACA TTAGTCGAAG CTTTTTCTGT CAATTTGAAC
CAACATAAGT TCTACCTCCA CGCAAAGCAG GTGGAGGTTT TAGTTCACCT CTTAAAGGGC
GAGAACGTCG CACTAAGTGC GCCAACAAGT TTCGGTAAAA GTCTACTACT AGACGCCTTT
ATTGAAAGGG CCAATCCCTC AACCGTTGTC GTCATCGTTC CAACTATAGC TCTTATCGAT
GAGACTCGGA GGCGCCTTCA AGGAAACTTT GGGAGTCGAT ATCAGGTGAT ATCAAAAGCA
TCGGATGTGC GAAGAGAAGG GCGGGTGATC TATGTCCTTA CCCAAGAGCG CTTCTTAAAC
CGGGACGACA TCAAGATCAT TGATTTTCTA TTTGTTGATG AATTTTACAA GCTCGATCCC
GGTCGCGACG ACGCAAGGTT CAGGACACTT AACGTGGCTT TTTATCGCGG CCAACGCATT
GCTCGTCAGT TTTTTTTGGC GGGGCCTAAT GTTGCCAGCT TGAGCGTTGG CCCGGCATGG
AAAGACCGTA TTACATTTAT GAGATCCAAC TACCAGACGG TTACCGTGAA TGCGATTGAT
AGAACAAGAA GCCTTAAGAG GTTCTCTACC TTTCTTAGTG ATTTGCGGTC CGTCGGCGAG
GAACAATCGC TGATCTACTC AAGATCTCCC CCGGCATCTC GACGTCTTTT GGAAGAACTG
CTTGGAGGAG GTTATGCCCA GGAGTGTGAG ATAGGCACGG AACTAGGCTT GTGGATCGCT
GAGAATTACC ATCCTCAATG GGTTTTGGTG GATGCTGTTG GAAAAGGTAT TGCGTTGCAC
CATGGGAAGA TACCGCGGGC GCTTGCGCAG TTATTCGTTC AGCTGTTCAA CGAGGGGCGG
GTGCCAGTTA TGATCTGCAC CTCGACGCTG ATCGAGGGGG TGAACACGTC CGCCGAAAAT
GTTTTTGTTT ATGACAAAGA GATCGGCACT CGTCCATTTG ATTTCTTCTC GTTTTCTAAT
ATTCGGGGCC GCGTAGGCCG AATGATGCGA CATTTTGTTG GAAGGGTTTT TCTTTATCAC
GCTCCGCCTC AAGCAGATGA GCTAGTAGTC GAGATCCCGG CGCTTTCCGA TCCCAGTGAA
GCGGACGACT ATATCCTCAT GAACTACGAG AAGGACGATC TTGATGGCTC CGCGCTAAGA
AAGCAATCCG CCCTCCCTCT TGAATACAAC TTGAGCTTTG AAACGCTCAA AGAGTTTGGT
CATTTTGGAG GCGAAGCTCT ACACAACACC AAGGACACCG TTAGGAGGAT GATAGGCGAT
AAGCCTAAAC ATTTTAACTG GTCTGGGCAT CCTGATTATA ATCAGAGAGT AGCGCTGGCG
AAAGCTATTC GGCCACTACT TACAAGCAAG AAAGACAAAT CCACCAGACT GACCGCAAAA
CAAATGGCTT GGGCGTGGGA TATGTTATCT AAGCTCAAGT CGCTCCCGGA GTTCCTTCAC
TGGTTTCAAA TTACGTTCTC GGCGGACGAA GTGCAGGAAG GTATCGAGAG AGCTTTCGAA
TTTCTGGGGT CATGTGAGTT CAACCACGCA ACGGCAACAG CGGCGGTGAA TCGACTCGTC
CTTGAGCTTC GGCCGGATAT GAATGCCGAC TATTCTCTAT ATTCGTTTCA GTTGGAATCG
TGGTTTCGCC CGCCTTGGCT GAAGGAGCTG GATGAGGTGG GTATCCCACT CCCACTGTCG
GAGCGGTTGC TTCGCTTTGT CGGGCGCCAG GATGATTATA AAGCTGTGCT TCGGCTTCTA
AAGGGGCTTC CGGCGCAGGT GATGGATACA CTTTCGCCGA TAGATATAAT GTTGATTTCC
AGAGCGACTG AAGCTGAAAG AACGCAGCTC TTTTAG
 
Protein sequence
MSLLGTLRGI SETTPLSADD VFEAILEASR ISNSARYEDP EALEIAIRLL DASERGLVPF 
GSHTAIEHLA EECGLFPYLD IDTLTPFRAT LVEAFSVNLN QHKFYLHAKQ VEVLVHLLKG
ENVALSAPTS FGKSLLLDAF IERANPSTVV VIVPTIALID ETRRRLQGNF GSRYQVISKA
SDVRREGRVI YVLTQERFLN RDDIKIIDFL FVDEFYKLDP GRDDARFRTL NVAFYRGQRI
ARQFFLAGPN VASLSVGPAW KDRITFMRSN YQTVTVNAID RTRSLKRFST FLSDLRSVGE
EQSLIYSRSP PASRRLLEEL LGGGYAQECE IGTELGLWIA ENYHPQWVLV DAVGKGIALH
HGKIPRALAQ LFVQLFNEGR VPVMICTSTL IEGVNTSAEN VFVYDKEIGT RPFDFFSFSN
IRGRVGRMMR HFVGRVFLYH APPQADELVV EIPALSDPSE ADDYILMNYE KDDLDGSALR
KQSALPLEYN LSFETLKEFG HFGGEALHNT KDTVRRMIGD KPKHFNWSGH PDYNQRVALA
KAIRPLLTSK KDKSTRLTAK QMAWAWDMLS KLKSLPEFLH WFQITFSADE VQEGIERAFE
FLGSCEFNHA TATAAVNRLV LELRPDMNAD YSLYSFQLES WFRPPWLKEL DEVGIPLPLS
ERLLRFVGRQ DDYKAVLRLL KGLPAQVMDT LSPIDIMLIS RATEAERTQL F