Gene RPD_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1004 
Symbol 
ID4021479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1136218 
End bp1138167 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content70% 
IMG OID637961195 
ProductAsmA 
Protein accessionYP_568143 
Protein GI91975484 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2982] Uncharacterized protein involved in outer membrane biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.478001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.298922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCA CCAAGATCGC CGGCATCCTG ATTGCGGTGC TGATCGCCGC AGCGGCGCTG 
CTGCTCACCA TTGGAATTCC GTCGGGCGCC GTGACCTCGG CGATCCAGTC GCGCGTCGAG
CGCGACACCG GCTACCGGAT CGCGATCGCC GGCGCGAGCA AAGTGAGCCT GTTTCCGGGC
TTCGCCGTCA CGCTGAACGA CGTCACGGTG CAGAACCCGA ACGATCGCAA TCCCGACAGC
CGTTTCACGA TCGGCAACGT CCGCGCCGAA CTGGCGCTCG GCAGCGTGAT CGCGGGCAAG
CCGCACGTCA CCGAACTGAC GCTGACAAAG CCGGAGCTTC GCGTCCCGCT GATCCGCGAC
CGCAACAATG CGCCGGCCGC CTCCTCATCC GCCGGCAGCG GCAAGACGAT CGACGCCGCG
ATCGATCGCA TCACGATCAA GGACGGCGTC GTCGTGATGG CGAATGCGGC CGATCGCGTC
GAGGACCGGA TCGATCGCAT CAACGCCGAG ATCACCATCG ATCCGGAGCG CCGCATCCAT
GCGATCGGCG GTGCGCAGCT CGCGGGCAAG CCGGCCAAAT TCGAGATCAA GGCGGCGCCG
CCGGCGGCGG ATGCGCAGCC TGTGCCGGTG GATTTCAAGC TCGACGCGCC CGACCTGCTG
AGCTGGCCGC TCGCCGCCAA GGCGGAGGTT CGGCTGCGCG GCGCGCTGCT GCAGATCAAC
GGCCTGTCCG GCACGCTCGG CGACGGCGCC TTCAACGGCT GGGCTTCGGC CGATCTCGCC
GGCAAGCCGC AACTCAAGCT CGACCTCGAT TTCCAGCGGC TCGACCTCGG CAACCCGCGC
AAGCCCACCG CGCCCGGCGC GCCCTGGAGC GATGCGCGAT TCGATCTGTC CGGGCTGAAC
TATGTCGACG CCGAGATCCG GTTGTCGGCG GCCGAGCTCA ACCTCGGCGC GGCGCATTTC
GCTCCCGCGT CGATCGACGC CAAGCTCGTG AGCGGCGCGG TCACCGCGCA GTTTGCGCAG
ATTGGCGTCT ATGGCGGCGA GGCGGAGGGG CAGCTCGGCA TCGACGCGTC GCAGCGCACG
CCGACGTTCA GCCTGCGCGG CGATCTCAAC GGCGTCCGGG CGCTGCCGCT ACTCAGCGGC
TTGGCCGAAT TCGACAAGAT CGACGGCAAG ATGCAGGCGA AGCTGGCGCT ACGCGCCAGC
GGCGACAGCG CACGCGCCAT CCTGTCGACG ATCGCCGGCA CCGCCTTCGT CAACGTCCGC
GACGGCGAAA TCCGCGGGCT CAACGTCGCG CGGATGATCC GCAATCTGAC CACCACGACG
CTGTCCGGCT GGCAGGAAAA CGGCGCCGAG GTCACCGACC TGACCCAGCT CGGCGCCTCG
TTCCGGATCG AGCAGGGCAA GGCCGCGACA GCCGATCTCG CGCTCGCCGG CCCGCTGGTG
CGGATGACCG GCGCGGGCAC GATCGATCTC GGCGCCAAGA CGCTGTCACT CAAGGTCGAG
CCGAAGCTGG TGCTGACCAC GCAGGGCCAG AGCGCGACGG GACAGGCGCC ACAGAACGGC
GCGGCCGCCG AGCCTGTCGG CCTCGGCATT CCGGTGGTGA TCGAGGGGCC ATGGGCGTCG
CCACGGATCT TCCCCGACGC CGCCGGCATC CTCGACAATC CCGACGCCGC CTATGCGCGG
CTGCGGGAGA TGGGCAAGGG CTTGTTCGGC GCGCTCGGCG GCGGCGGCGC GGCCGGCAGC
CCGGGCGCGG ACAACCCGCT CGGCGGGGCG CTCGGCGAGA GCATCGGCCG GCTGATCCAG
CAGGGCCTCC AGAGCGGCGC GGCCGCACCC TCCCGCGGCG CGCAGCCGGC CCAGCCGCCC
GCGCAGCAAC CCGGCGCGGC GCCCGCGCCC GACCGGCCGG ATAGCGACGC GGCCATGAAC
GCGATCATGA AGCAATTATT CGGCCGCTAG
 
Protein sequence
MKPTKIAGIL IAVLIAAAAL LLTIGIPSGA VTSAIQSRVE RDTGYRIAIA GASKVSLFPG 
FAVTLNDVTV QNPNDRNPDS RFTIGNVRAE LALGSVIAGK PHVTELTLTK PELRVPLIRD
RNNAPAASSS AGSGKTIDAA IDRITIKDGV VVMANAADRV EDRIDRINAE ITIDPERRIH
AIGGAQLAGK PAKFEIKAAP PAADAQPVPV DFKLDAPDLL SWPLAAKAEV RLRGALLQIN
GLSGTLGDGA FNGWASADLA GKPQLKLDLD FQRLDLGNPR KPTAPGAPWS DARFDLSGLN
YVDAEIRLSA AELNLGAAHF APASIDAKLV SGAVTAQFAQ IGVYGGEAEG QLGIDASQRT
PTFSLRGDLN GVRALPLLSG LAEFDKIDGK MQAKLALRAS GDSARAILST IAGTAFVNVR
DGEIRGLNVA RMIRNLTTTT LSGWQENGAE VTDLTQLGAS FRIEQGKAAT ADLALAGPLV
RMTGAGTIDL GAKTLSLKVE PKLVLTTQGQ SATGQAPQNG AAAEPVGLGI PVVIEGPWAS
PRIFPDAAGI LDNPDAAYAR LREMGKGLFG ALGGGGAAGS PGADNPLGGA LGESIGRLIQ
QGLQSGAAAP SRGAQPAQPP AQQPGAAPAP DRPDSDAAMN AIMKQLFGR