Gene Rsph17025_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4039 
Symbol 
ID5086212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp74885 
End bp76231 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content67% 
IMG OID640485602 
Producthypothetical protein 
Protein accessionYP_001170196 
Protein GI146280039 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.415702 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACC GTCCCCGTCC ACCCGAGCAG AACGACCTCT TCCGGCCGCG CCTGGTCGAC 
ATGATCGACA TGCGTCACGA GCTGGTCACG CTGTCGGCCC TGATCGACTG GGAGTTCTTC
GAACGCGAAT GGGCCGGGTT CTTCCCGTCG ACGACCGGAC GACCGGCGAT ATCGCCGCGA
CTCGTGGCGG GGCTGCTCTA TCTGCAGCAC GCGTTCCGGC TGTCGGACGA GGCTGTGGTC
GCGCGCTGGG TCGAGAACCC TTACTTTCAG CACTTCACCG GCGAGACCTT CTTTCAGCAC
AAGCCGCCGA TCGATCCCAC CTCGCTCATC CGGTGGCGGA AGCGGATCGG CGAAGAGGGT
GTCGAGTGGC TGCTGACCAA GACCATCGAG GCCGGGCGGG AATCGGGTGC TGTCGAGGAC
GCGAGCCTGG ACGAGATCGC CATCGACACG ACCGTGATGG AGAAGAACAT CTCGCACCCG
ACGGACTCCC GGCTCTACGA GCGGGCGCGC AGCCAACTGG TGGCCCTGGC CCGGGACGCC
GGCATCGAGT TGCGGCAGAG CTACGCGCGC CTTGCGCCGC GGCTGGCGGC GCAGGTCGGG
CGTTACGCCC ATGCCAAGCA GTTCCGGCGC ATGCGCAAGG CGTTGCGAAC GCTGAAGGGT
TACACCGGCC GCGTGATGCG GGACATCCGG CGGCAGCTCG ACGAAATCCC CGAGGGGCCG
CTGCGCGAGC GTGTGCTCGA CAAGCTCGTG CTGGTCTCGC GGCTGCTGCA CCAGCGGCCG
AAGGATCCCG GCAAGATCTA TTCACTGCAC GAGCCCGAGG TCGACTGCAT CTCGAAAGGC
AAGGCCCGCG TGCGATACGA GTTCGGCACC AAGGTCAGCA TCGCAACCAC GCTGAAGGGC
GGCTTCGTCG TCGGCATGCG GTCGCTGCCG GGGAACCCCT ACGACGGCCA CACCCTCGGC
GAGGCCCTGG AGCAGGTCGG CATCCTCACC GGCCACCCGC CCAAGCGCGC CGTCGTCGAT
CGCGGCTACA AGGGCCACGG CGTCGAGCAC ACCCAGGTCC TGATCAGCGG AACCCGCCGC
GGCCTCACAC CGGCGCTCGC GAAGGCGCTC CGCCGACGCA GCAGCATCGA GCCCGAGATC
GGCCACATGA AGGCCGACGG AAGGCTCGCG CGCTGCTTTC TGCAAGGCAC CTTCGGCGAT
GCGCTCTTCG CCGTCCTCTG CGGCTGCGGG CACAACATCC GCAAGATCCT CGCCCATCTG
AGGAAGCTTC TTGCCGCCGT CATTACCCTC GTTCTGGCGA TGATCCGGCA GGAGCACGCT
CGCGGCTACA GTCACGCGGC CGCCTGA
 
Protein sequence
MKHRPRPPEQ NDLFRPRLVD MIDMRHELVT LSALIDWEFF EREWAGFFPS TTGRPAISPR 
LVAGLLYLQH AFRLSDEAVV ARWVENPYFQ HFTGETFFQH KPPIDPTSLI RWRKRIGEEG
VEWLLTKTIE AGRESGAVED ASLDEIAIDT TVMEKNISHP TDSRLYERAR SQLVALARDA
GIELRQSYAR LAPRLAAQVG RYAHAKQFRR MRKALRTLKG YTGRVMRDIR RQLDEIPEGP
LRERVLDKLV LVSRLLHQRP KDPGKIYSLH EPEVDCISKG KARVRYEFGT KVSIATTLKG
GFVVGMRSLP GNPYDGHTLG EALEQVGILT GHPPKRAVVD RGYKGHGVEH TQVLISGTRR
GLTPALAKAL RRRSSIEPEI GHMKADGRLA RCFLQGTFGD ALFAVLCGCG HNIRKILAHL
RKLLAAVITL VLAMIRQEHA RGYSHAAA