Gene Rsph17029_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2030 
Symbol 
ID4897744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2151144 
End bp2152241 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content54% 
IMG OID640112623 
Productphage integrase family protein 
Protein accessionYP_001043905 
Protein GI126462791 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.755048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACCA TCGTAGAACG CGCCCGCAAG GATGGCACCA AGTCATACCT CGCACAGATC 
ATACGCCGCA AGCACGGCTT CGCAGAGTCC CGAACCTTCC CCACACGCAA GACCGCCGAG
GCATGGGCCA AGATGCGCGA GCGCGAGCTT GATGCCCAGA TCGGGGCAGG GGGCGTCCCT
ACCACCCGTG CCGAAGTCAC CACCACCCTA GGCGACCTGA TCGACAGGTT CCTTGCGGAC
TCTGCGAAGC CTATGGGGAA GACACAACGC AACTGCCTTA AGGTCATTCG GACTGAATAT
GAAGTCGCCA ACAAGCGTCT TGATCAACTG ACGTCGAAAG ACCTTGTCGA AATGACGAAG
GAGATCGGGA ACCGACCCAC AGTCCGGAGC AAGTCAACGC CACTCAATTA TCTTGCCCAT
CTTAGCAAGT TGTTCGCCGT TGCGAGGCCC GCCTATGGGT ACCCGCTTGA TAAGTCGGTT
CATGACGACG CACTGAAAGC GTGCAAGGCG CTAGGATACA CCGGTCAATC GGGGAAAAGG
GATCGTCGTC CGACTGTTGG TGAGATAAAT CGTCTCATGG TCCACTTCGA TACGATGCAA
GGCAATACCA TTCAGATGGC AAAACTCGTT CCTTTCGCGA TCTTTTCGGC AAGGCGACTG
GATGAGATAT GCCGTATAAC ATGGACGGAT TATGAGCCGG AACATAAGCG GGTCATGGTC
CGCGACATGA AGCACCCCGG AAACAAACAG GGAAATGATC AATTTGTAGA TTTGCCCGAT
CCTTGTTGTG CGATAATAGA CTCGATGGAC AAGGTTGACG CGAGGATATT CCCCTTCAAC
TCCGCCAGTG TGAGCACGGC TTGGGCAAAA GCCTGCAAGA TGCTGGAAAT CGAAAATCTC
AAATTCCATG ATTTGCGGCA TGAGGGAGCA AGTCGCCTTT TCGAGATGGG CTGGACAATA
CCGCAGGCGG CATCTGTTAC CGGACATAGG GCATGGGCAA CTCTACAACG CTATTCGCAC
TTGAGACAAA CCGGCGACAA GTGGAGGGAT TGGGAATGGA TTCCTAAAGT GACGATGAAA
CATGCAGCGA CTGGATAA
 
Protein sequence
MATIVERARK DGTKSYLAQI IRRKHGFAES RTFPTRKTAE AWAKMREREL DAQIGAGGVP 
TTRAEVTTTL GDLIDRFLAD SAKPMGKTQR NCLKVIRTEY EVANKRLDQL TSKDLVEMTK
EIGNRPTVRS KSTPLNYLAH LSKLFAVARP AYGYPLDKSV HDDALKACKA LGYTGQSGKR
DRRPTVGEIN RLMVHFDTMQ GNTIQMAKLV PFAIFSARRL DEICRITWTD YEPEHKRVMV
RDMKHPGNKQ GNDQFVDLPD PCCAIIDSMD KVDARIFPFN SASVSTAWAK ACKMLEIENL
KFHDLRHEGA SRLFEMGWTI PQAASVTGHR AWATLQRYSH LRQTGDKWRD WEWIPKVTMK
HAATG