Gene RSP_4120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4120 
SymbolyapH 
ID3711836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007489 
Strand
Start bp85057 
End bp86430 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content66% 
IMG OID640069469 
Producthypothetical protein 
Protein accessionYP_345336 
Protein GI77404763 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGACG CTACTGGGCA GCCTCATGTG GCGGCAGGGG CAATGCAGAG GTTTCGCCCT 
GGGGCCAAAG GGTCGAAGTA TGGCCTTTGG GGCGGCGTTG CCTTGGTTGC CCTGACGACT
GGCGCGGCGG GAGCCGAAAT GGCCATCGTC GGAGACGTGA CCATCACGGG GCACATAAGC
GCGGGAAGCA TCTCCGCGGC AACCCTCGAA GGCGCCCTCA ACGTCACCGG TCCCAGCGTA
CTGAGCGACG CCAGCGCGAC CTCCCTCTCC GTCAGCGGCA CCTCGCAGCT CAACGCCCTA
GCTGTTTCCG GCGCCAGCAC TCTGCAGAGC GCAACCGTTC AGGGCAACGC TGCAATCGCG
GGCGCACTGA ACGTGGCCGG ACAAAGCACC CTCGGCGACG CGCGGATGCA GACCGCCACC
GTTCAGAAAG CCCTTGCCGT CAACGGACCG ATGGGCGTGA GCGGCACGGC GAGCTTCGGG
TCGGATCTGG AGGTAGGAGG CGCCGGACGC TTCGGAAGCG CCAGCGTCGC GGGAACCACT
TCGACCGGAG CCCTTTCCGT CGCCGGCACC TCACAACTCG ACACCCTCGT CGTGTCGGGA
GCCAGTACGA TGGCAAAGGT CGACGTGCTG GGCCCGCTGG CCGTAACCGG GGCCGCCGGC
TTCGGCGATC TCGTTGCCAA GGACATGAGG ACGGAAGACC TGCACGTCAC CGGCAACCTG
ACCATCGACG GAAACCTGTC GCTTCCCTCG AAGTTCTCCT TCGGAGAGCT GGAAACTACC
GGAAGCAGCC GGCTGGCTGA TCTGCAGACG ACCGGTCAGG TTGCAATGAA CAACGCCGGC
TCCAGCTTCA CCTTGGGCTC GTCCGGCATT CTGGCCACGA CCGCAGGGGG AGCCCGGGTG
CAACTGACGG ACACGGCCGC AGTCCTTACC CATGGCGGCA ATGGCATCAC GGCCACGGCC
AATGGAACCA CCCGGATCAC TGCCATACAC GAGGCAACAC TGCAGGGAGG TAACACCACC
CTTGCCCTGA CCGACACCGG CGCCCGCCTT TCCGGATCCG GCAGCGCACC CGCCCGCCTG
TCGGGGATCG CCGACGGCGT GGAAGACAAT GACGCCGTCA ACGTCGGGCA GCTGAATGAC
GGGCTTCGGG AGGTGAGTGC GGGCGTCGCG ATGAGCATGG CGATGGCACA GCTTCCAGCT
CCCCTCGACG GCAGCAATCA CTCCTTCGGC GTGGCCGTCG GTGGGTTCGA TGGCCAAGAG
GCGCTGGCCT TGGGGGGAAC TGCCATCGTG AACAACAATG TGACGTTACG TGGCGCGCTC
AGCCATGCCG GCGGCAAGAC GGGTGCCGGT GTCGGCGTCG GCTGGAGCTT CTGA
 
Protein sequence
MKDATGQPHV AAGAMQRFRP GAKGSKYGLW GGVALVALTT GAAGAEMAIV GDVTITGHIS 
AGSISAATLE GALNVTGPSV LSDASATSLS VSGTSQLNAL AVSGASTLQS ATVQGNAAIA
GALNVAGQST LGDARMQTAT VQKALAVNGP MGVSGTASFG SDLEVGGAGR FGSASVAGTT
STGALSVAGT SQLDTLVVSG ASTMAKVDVL GPLAVTGAAG FGDLVAKDMR TEDLHVTGNL
TIDGNLSLPS KFSFGELETT GSSRLADLQT TGQVAMNNAG SSFTLGSSGI LATTAGGARV
QLTDTAAVLT HGGNGITATA NGTTRITAIH EATLQGGNTT LALTDTGARL SGSGSAPARL
SGIADGVEDN DAVNVGQLND GLREVSAGVA MSMAMAQLPA PLDGSNHSFG VAVGGFDGQE
ALALGGTAIV NNNVTLRGAL SHAGGKTGAG VGVGWSF