Gene Rsph17025_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3719 
Symbol 
ID5085585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp620201 
End bp621472 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID640485282 
ProductIS66 Orf2 family protein 
Protein accessionYP_001169891 
Protein GI146279733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA CCACGACCCA TCTGGGCCTG CCCTATCTCC TGGCGGCGCA GGCGCAGAAG 
CATGTCACCC ACAACGAGGC GCTGCGCCTG CTCGACGCCA TGGTGCAGCT TTCGGTCCTC
GACCGGACCC GCACCGCACC GCCCGGCAGC TCGGCCGACG GCAACCGGCA TCTGGTGGCC
TCGGGCGCCA CTGGCCTCTG GGCCGGGTGG GACCTGAACG TGGCCTTCTG GGTGGACGGA
GCATGGCTTC GCCTCGTCCC GCGCACCGGC TGGCTGGTCT GGGTCGCGGC CGAGGAACTG
TTCCTTGTCT GGAACGGCAG CGCCTGGGAG GTCGTGGGCG AGCCCCGCGA CGTCTCGGAC
GCCGTCTTCA GCCTCGTCAA CGATGCGGAC CCGACGAAGA AGGCGACCTT CTCTCTCGCG
GGGATCAACG CGGGCACGAC GCGCAGCTTC ACGCTGCCGA ATACATCCTC GGAACTGGCG
ATCCTGGCGG GAACGCAGAC CTTCTCCGGC AACAAGACCT TTTCCGGCAC GCTGACGGCC
TCGGGCACCG TCACCGTCTC CGCAGCCTCG GCGAGCATCG GCACGGCCAC GACGACCGCG
ACCTACGGAA TGGGCACAGG GGCAACGACG ACGGGCGCCA CCAAGACCGT GAACATCGGC
ACCGGCGGAG CATCCGGGTC AACGACTGTC GTCAACATCG GCTCGGCCAC GGCTGGCGCG
GGCGGGACGA CCGTCATCAA CACGCCGACC GTCACCTTCG CCAATGCGGT CACGCAGGTT
GGCATGCCCC AGGCAAATCT GACCGCACAA CTCTTGGGCC TCGGCGGCGC GACCGCCGAC
AGCTACAACC GGCTCTCGGT GAACACCCCG GCCGTGCTGC TGAACAACGC AGGTGCAAGC
ATCGAAGCCA CCGTGAACAA AGCGGCCCCG GCGAACGACG CCGCCTTCGC CTTCAAGACT
GGCTTCTCCG CTCGCGCGCT GATCGGCCTC CTCGGCAACG ATGACTTCAG CTTCAAGGTC
AGCCCGGATG GGTCGGCCTT CTTCGATGCC ATCAGGATCG ACCGCACGAA CGGCCAAGTG
GAACTGCCGC AGCCGACGGT CCTGCCCGGC CTCAACGCGG CGCCGACCCC GCCACCGGCA
GGCAAGGCTT CAGTCTATGC GCGAAGCCGC GCAGGCGCAC CGTGGATCGA CGTGATGCGT
CCCTCGGGCC GGGACTTCCC GCTCCAGCCG CACTTCGGGG GTAAGCGACC GCCCACCGCC
CTCGTTGCCT GA
 
Protein sequence
MSDTTTHLGL PYLLAAQAQK HVTHNEALRL LDAMVQLSVL DRTRTAPPGS SADGNRHLVA 
SGATGLWAGW DLNVAFWVDG AWLRLVPRTG WLVWVAAEEL FLVWNGSAWE VVGEPRDVSD
AVFSLVNDAD PTKKATFSLA GINAGTTRSF TLPNTSSELA ILAGTQTFSG NKTFSGTLTA
SGTVTVSAAS ASIGTATTTA TYGMGTGATT TGATKTVNIG TGGASGSTTV VNIGSATAGA
GGTTVINTPT VTFANAVTQV GMPQANLTAQ LLGLGGATAD SYNRLSVNTP AVLLNNAGAS
IEATVNKAAP ANDAAFAFKT GFSARALIGL LGNDDFSFKV SPDGSAFFDA IRIDRTNGQV
ELPQPTVLPG LNAAPTPPPA GKASVYARSR AGAPWIDVMR PSGRDFPLQP HFGGKRPPTA
LVA