Gene Rsph17029_3845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3845 
Symbol 
ID4898574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp975363 
End bp976583 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID640114449 
Producthypothetical protein 
Protein accessionYP_001045697 
Protein GI126464584 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.521345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCCGA TCGGCCCAGC GGCCGCGCTG CCGGTGCACG GCGATGGGGG GATGATGCAG 
AACGAGCGGA TCGGCGAGGC CGGAACCCTT CTGGAGCCGG AAGCGCCCGA CGTGCCCTCC
GAGAACGAGA TCAGCCGTCT CAACCACCTG ATCGCGATCC GCAGGCTGCT GGGGATCATC
CTTCTGGTCC TCGTGGTCAT GGGCTTCTAT TTCGCCCGCG ACGTGGTCCT GCCGCTGATG
ATCGGCCTTC TGCTGGCGCT GACCTTCAGC CCGGTCGTGC GGGCCCTGCA GCGGATCGGC
ATCGCACCGC CCATCACCGC GACCGCCCTC ATCACCGCCC TCGCCGCCGT CATCGCGGTC
AGCGCCTTCC TTCTGAGCGG CCCTGTCTCG GACTGGATCA ATCAGGCGCC GCGGCTGGGC
GATCAGCTGC GCGAGCGGGC CCAGACCATC CTCGACTCGT TCGAGGCGGT GCGGAACGCA
TCGGAGCAGG TCTCGGAAAT CACCGACAGC GAGGATCCGA CGGTGCAGCG CGTCGCCGTG
CAGACGCCGG GGATCCTGTC GTCCGCAGTC GGCAGCGTGG CCTCGATCCT CACCACGATC
ATCGTGACGC TGGTGCTGGC GCTCTTTCTG CTCGCCTCGG GTGACCTGTT CTACATCAAG
CTGATCGAGG GCTTCCCCCG CTTCGGCGAC AAGAAGCGCG CCCTGCGCAT CGTCTACGGC
ATCGAGCGGC GCGTCTCGCG CTACCTCCTG TCGGTGACCA TCATCAATGC GGGGCTGGGG
GTGGTGATCG GCCTCCTGAT GTGGGGCACG GGAATGCCGA GCCCGCTCGT CTGGGCCATG
GCGGCCTTCC TTCTGAACTT CCTGCCCTAT ATCGGCGCCA TTGCCGGGGT TGCGCTGTCG
GCGGCCGTCG CCATCGTGCA TTACGATCAC CTGACGCAGG CCCTGCTGGT GCCCGCGCTC
TACCTGACGG CCACCGCCAT CGAGGGGCAG CTCGTCACCC CCATCGTCCT CGGCCGCAGG
CTCGAGCTGA ACACGGTCTC GGTCTTCGTC ACGGTGATCT TCTGGGGATG GCTCTGGGGC
ATTCCGGGGG CGCTCGTGGC GGTGCCCTTC CTCGTCTGCA TCAAGGTGGT CTGCGACAAT
GTCGAATCCC TGCATGCGGT CGGCAATTTT CTGGGCGCTC GCGCGCCGTT GCCCGATCTC
GAGCAGGATA CGCCGGAGTA A
 
Protein sequence
MHPIGPAAAL PVHGDGGMMQ NERIGEAGTL LEPEAPDVPS ENEISRLNHL IAIRRLLGII 
LLVLVVMGFY FARDVVLPLM IGLLLALTFS PVVRALQRIG IAPPITATAL ITALAAVIAV
SAFLLSGPVS DWINQAPRLG DQLRERAQTI LDSFEAVRNA SEQVSEITDS EDPTVQRVAV
QTPGILSSAV GSVASILTTI IVTLVLALFL LASGDLFYIK LIEGFPRFGD KKRALRIVYG
IERRVSRYLL SVTIINAGLG VVIGLLMWGT GMPSPLVWAM AAFLLNFLPY IGAIAGVALS
AAVAIVHYDH LTQALLVPAL YLTATAIEGQ LVTPIVLGRR LELNTVSVFV TVIFWGWLWG
IPGALVAVPF LVCIKVVCDN VESLHAVGNF LGARAPLPDL EQDTPE