Gene Rsph17029_3926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3926 
Symbol 
ID4898794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1061708 
End bp1062808 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID640114529 
ProductApbE family lipoprotein 
Protein accessionYP_001045776 
Protein GI126464663 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGACCG TGACCCTCCC GCGCCGGGCC TTCCTTGCTA CCCTCGCCGC CCTCGCACTC 
GGCGGGGTGG CGGCCCTGCG CCTCGAGCCC GCGTCGGACG TTCGCGAGAC CTTCTATCTC
TTCGGCACGC TGGTCGAGAT CGAGACCCAC GGCGTGCCCG AGCGCACGGC CCGCGCCGCC
ATCGCCCGCG CGGGCCAGAG CCTTGAGGCC GCGCATCGCG ACTGGCACGC TTGGCGGCCG
GGCGAACTCG GGACCTTGAA CGCGGCCATC GCGGCGGGGC GCCCACAGCG CGTGGATGCG
ACCCTCGCGG CGCTCCTGCG CAAGGGGCGC GCCCTCTCCT GTGCGAGCGG CGGGCTGTTC
GATCCGGCCA TCGGCAGGCT GGTCGGCGCC TGGGGCTTTC ACGCCGACAC ATTGCCCGAG
GGCGATCCGC CGTCTGCCGT CGCCATCGCG GCCCTCGTCG CGCGCCACCC GCGCATGACC
GACCTGCGCT TCGACGGCAC CGAAGTGCGC TCGCTCAATC CCGCGGTTCA GCTCGATCTC
GGGGCCTATG CCAAGGGCGC GGCGCTCGAG ATGGCGGCGG CCGAGCTTCG GGCGGCAGGC
GTCGAGGATG CGGTGCTGAA TGCGGGCGGC GGCGTGCAGG TCATCGGCCG GCACGGCGCG
CGGCCCTGGC GGGTGGCGAT CCGCGATCCG TTCCAGTGGG GCGTGGTGGC GGGCGTCTCG
CTCCGTCCCG GCGAGGCCCT CCATACGTCG GGCAATTACG AGCGCTATTT CGACCGCGGC
GGGGTGCGCT TCTCGCACAT CCTCGATCCG CGGACGGGCC GTCCGATGCA GGGCATCGTC
TCGGTCTCGG TGCTCGACAC CGACGGGGCG CGGGCCGATG CCGCGGCCAC CGCTCTCTGC
ATCGCAGGCC GCGAGGACTG GCCGCGGGTC GCGGCGGCGA TGGGCGTGCG CGCCGTCCTG
ATGATCGCGG ACGATGGCGC GGTCTTCGCC ACGCCCGAGA TGATGGCGCG GCTCGAACCG
GCGCAGGGCG GCTACCCCGC CCCGGTGCAG ATCGTATCGC TGCCGGAGGA TGTCACACCG
CCCTCCTGTC CCGAGGACTG A
 
Protein sequence
MRTVTLPRRA FLATLAALAL GGVAALRLEP ASDVRETFYL FGTLVEIETH GVPERTARAA 
IARAGQSLEA AHRDWHAWRP GELGTLNAAI AAGRPQRVDA TLAALLRKGR ALSCASGGLF
DPAIGRLVGA WGFHADTLPE GDPPSAVAIA ALVARHPRMT DLRFDGTEVR SLNPAVQLDL
GAYAKGAALE MAAAELRAAG VEDAVLNAGG GVQVIGRHGA RPWRVAIRDP FQWGVVAGVS
LRPGEALHTS GNYERYFDRG GVRFSHILDP RTGRPMQGIV SVSVLDTDGA RADAAATALC
IAGREDWPRV AAAMGVRAVL MIADDGAVFA TPEMMARLEP AQGGYPAPVQ IVSLPEDVTP
PSCPED