Gene Rsph17029_4155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4155 
Symbol 
ID4895037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp92746 
End bp94242 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content69% 
IMG OID640110546 
ProductMlrC domain-containing protein 
Protein accessionYP_001041858 
Protein GI126464882 
COG category[S] Function unknown 
COG ID[COG5476] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones90 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value0.870407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCCA TCCTGATCGT CGAATGCATG CAGGAGATTT CCTCCTTCAA TCCGGTGCCG 
TCCGACTATG AGCTGTTCCA TATCGAGGAG GGGGAGGATC TCTTCCAGCA TCGCGGCCGC
AACACCGCCA TCGGCGGCGC GCTCTCGGTG CTCGAGGCCA CGCCCGGCGT CACGCTGATC
CCGACCGTTG CCGCCCGCGC CGACAGTGCC GGCATCCTGT CGGCCGAGGG CTGGGCGCGG
CTGTCGGCGC AGATCCTCGA CCGGGTGCGG GCGGGGGTGG GCGGCGCGGA CGCCATCTAT
GTCAGCTTGC ACGGGGCCAT GGGGGCGGTG GGTGAGCCCG ATCCGGAGGG CTGGCTGCTC
GCGCAGATCC GCGCGCTTGC CGGACCGGAT CGGCCGATCA TCATCTCGCT CGATCTGCAC
GGCATTCTGA CCGAGCGGAT GATTGCGCAG GTGGATGGCA TCGCCGTCTA TCACACCTAT
CCGCATGTCG ATTTCGCCGA TACCGGCGCG CGTGCGGCGC GGCTTCTGCT GGACCGGCTG
ACAGCCCGCC GCCCCGCCAG CATCGTCCGG GCGGTGATCC CGGCGCTGGT GCGCGGCGAC
GAACTGATCA CCGCGACCGG CTGCTACGGC GACCTGATCC GCGAATGCCG GCGTCTCGAA
GCCGATGGCG TGGTGCTGGC GGCCGGCCTG ATGATCGGCA ATCCCTTCAC CGATGCGCCG
GAACTCTGCT CTCAGGTCGT GCTCACCACG GAAGACCCTC TGCGGGGCCG GGCCGAGGCC
GAGCGGCTGG CCGCGGCATT CTGGGCCCTG CGCCATCGCA TGCAGGGCCG TCTGATCGGT
CTTTCCGACG CCATTGCCGA AGCGCGGACA CTGACCGGCC CGGTCGTGTT CACCGATGCG
GCCGACGCGA CCTCCTCGGG GGCGAGCGGC GATTCGAATG CCATCCTGCG CGGGCTGCTC
GCTGCGGACT ATCCGGGCCG GGTGCTGGCG CAGATCGTCG ATCCGCAGGC CGCCGAGGCC
GCCCACAGGG CCGGGGTGGG GGCCGAGATC CCGGTCACGC TCGGCGGGCG GATGGATCCT
GCGCGTTTCA CGCCGCTGGA GGTCACTGCG CGGGTGCGGC TCCTGTCGGA CGGCCTGACC
CGGCTCGAGA CCATGAAGAC ACCGCTGAAC GCCGGGCGCT GCGCCGTGCT CGAATTCGGC
ACCGTCACCG TCGTCGCGAT GAGCCTGCCC GCCATGCTCT TCGACCGGGC GATCTACTAC
GCCAACGGGC TGGATCCGGC CGATTTCGAC CTGATCGTGG TCAAGTCGCC CCATACCGAA
CACCACATGT ATGACGCCTG GGTCACCCGC AACTTCAACA TCGACGCGCC GGGATCGACC
TCGGCGGATC TGGCCAGCCT CGGGCACCGG ATCTGTGCCC GGCCGATGTT CCCGCTCGAT
CCCCTGACCG ATTTCACCCC GCGCGCGACG GCCCATCTGC GCGCAGACCG GAGCTGA
 
Protein sequence
MPAILIVECM QEISSFNPVP SDYELFHIEE GEDLFQHRGR NTAIGGALSV LEATPGVTLI 
PTVAARADSA GILSAEGWAR LSAQILDRVR AGVGGADAIY VSLHGAMGAV GEPDPEGWLL
AQIRALAGPD RPIIISLDLH GILTERMIAQ VDGIAVYHTY PHVDFADTGA RAARLLLDRL
TARRPASIVR AVIPALVRGD ELITATGCYG DLIRECRRLE ADGVVLAAGL MIGNPFTDAP
ELCSQVVLTT EDPLRGRAEA ERLAAAFWAL RHRMQGRLIG LSDAIAEART LTGPVVFTDA
ADATSSGASG DSNAILRGLL AADYPGRVLA QIVDPQAAEA AHRAGVGAEI PVTLGGRMDP
ARFTPLEVTA RVRLLSDGLT RLETMKTPLN AGRCAVLEFG TVTVVAMSLP AMLFDRAIYY
ANGLDPADFD LIVVKSPHTE HHMYDAWVTR NFNIDAPGST SADLASLGHR ICARPMFPLD
PLTDFTPRAT AHLRADRS