Gene Rsph17029_3870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3870 
Symbol 
ID4898524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp997542 
End bp999158 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content72% 
IMG OID640114474 
Producthypothetical protein 
Protein accessionYP_001045721 
Protein GI126464608 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0563781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGGG CATGGGGTTT TGCGCTGGCG ATCCTCTTGG CCGCGCCGCC CGCCGCGGCG 
GGCGGGATCA CGGTGGACGA GACATATTGG AACCCGCAGC CGGCCGAGGG CGACCTCGTG
CTGCCGATGC CCTGCGGCGG CGCGATGGCG TTCCGCCCGG TGGCGACACC CAATGCCGAC
GGTGCGGTGG GCGACGTTCC GGTCATCCTC GGTCGCGAGG ACGAGGACCA GCCCTATCTG
GACGGCACGC GGCGCTCCTA TGTCTCGGGA GGGTTTCCGG GGACGGGCGA GGCGGAAGCC
AAAGCCATGT TCTACATGGC GAAATACGAG ATCGCCGAGG CGCAGTACCG CGCTGTGACG
GAAGGCTGCC CGCAGAAGGA GCCGCGCCGG CGCGACTTCC TGCCGGTGAC GGGTGTGACG
AAGACCGAGC TTGACGCCTT CGCGCAGGCC TGGACGGTCT GGCTGATGCG GAACGCGCCG
GAAAGCCTCG CCCTCGCCGG TGCGGCGCCG GCCCATCTGC GGCTTCCGAC CGAGGAGGAG
TGGGAGTTCG CCGCCCGGGG CGGTCTGGCC GTCGATCCGG CGCTGTTCCG CGGCGCGCTG
CCGCCGATCC CGCCCGGTCA CTCCGCGTCC GAATACATCG CGCATGGCGG CAACGACAGC
GCGGGCGGCA AGGTGCAGGC GATAGGCACG CTGGCGCCGA ACCCGCTCGG GCTGCACGAC
ATGCTGGGCA ACGTCTCGGA ATATGTCCTG ACCCCCTTCG CGATGGTGCG CCACGGGCGG
CTGCACGGGC AGGCCGGCGG CTACGTCAAG CGCGGCGGCG ATGCACGCAC GCCGCTCGAC
CAGATCACCA GCGCGACCCG GTTCGAGGTG CCGCCCTTCG ACGCGCATTC GAAGGATGTG
ACGCGCGAGG CTTTCACCGG GGGGCGGCTC GTGCTGTCCA CGCTCTCGAT CACCTCGGCC
GAGCAGGCGA AGGCGGTCGC GGCGGCGCTC GAGACCCTGT CGCGGGCCGA CCCTGCGCTC
GACTCGGCGG CCTCGGAGGC CGAGGTTCTG GCGCTGCTCG ACCGGCTTCA GCGCGAGGCC
GGCGATGCCG CCGACCGCAG CCGCTTTGCC ACCATCGCCC GGACGATCCG CGAGGCCCGC
GCCGAGACGA ACGCGCAGCG CGACCGGACC ATCCGGATGA TCCTCGGCTC GTCGGTGCTG
ACCTGCGACC AGATCGTGCA GCGCTACCTG AACGCGCTGG CCATCGCGGC GCTGGTGCCG
AGCTATGACG GGCTGGCCGC CGAGGCCGAG GCCAGCGGCG ACACCGCACT GGCGCAGGAG
GTGGCCGAGG CCCGCGCCGA AGCAGAGGCG AAGCTGCGCG AGATGGAGGA GGCGGTCGGC
CGCGAAAGCG TCGACTACGC CAACATGATC GAGGGGCTTT CGGCCGAATT CTCGCAGGAG
CTGCTGGCGG CGCAGATCGC GGCCGTGCGG GGCGAGCACG AGAGCCGCGG CCCTCGGCGC
GGTGCCTGCC TCGGCGCGGT GCAGGCCCAT CTCGACCGGC GGGCCCGGAG CGGGATGAAC
GACCTCGCCG CCATCCGGTC CGACATGCAA CAGATCGCGG CCGCGCAGGC CCGCTAA
 
Protein sequence
MARAWGFALA ILLAAPPAAA GGITVDETYW NPQPAEGDLV LPMPCGGAMA FRPVATPNAD 
GAVGDVPVIL GREDEDQPYL DGTRRSYVSG GFPGTGEAEA KAMFYMAKYE IAEAQYRAVT
EGCPQKEPRR RDFLPVTGVT KTELDAFAQA WTVWLMRNAP ESLALAGAAP AHLRLPTEEE
WEFAARGGLA VDPALFRGAL PPIPPGHSAS EYIAHGGNDS AGGKVQAIGT LAPNPLGLHD
MLGNVSEYVL TPFAMVRHGR LHGQAGGYVK RGGDARTPLD QITSATRFEV PPFDAHSKDV
TREAFTGGRL VLSTLSITSA EQAKAVAAAL ETLSRADPAL DSAASEAEVL ALLDRLQREA
GDAADRSRFA TIARTIREAR AETNAQRDRT IRMILGSSVL TCDQIVQRYL NALAIAALVP
SYDGLAAEAE ASGDTALAQE VAEARAEAEA KLREMEEAVG RESVDYANMI EGLSAEFSQE
LLAAQIAAVR GEHESRGPRR GACLGAVQAH LDRRARSGMN DLAAIRSDMQ QIAAAQAR