Gene RPD_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4022 
Symbol 
ID4024539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4469508 
End bp4471118 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content63% 
IMG OID637964225 
Productextracellular solute-binding protein 
Protein accessionYP_571142 
Protein GI91978483 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00714098 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCACA CGTCGCATTG GCTGCGTTCA GTAGCTGTGT CGAAATTCGC AGTGCCGGCG 
CTGGCGCTCG CAGCTTCGCT GACGCTCCCT GCGTTTGCTG ATGCCAAGAC CATTCATGCG
GTGATGCATT CCGATCTGCG CGTGACCGAT CCCGGACTGA CCACCGCCTA CATCACCCGC
GATCATGGCT ACATGGTCTA TGATACGCTG CTCGCGATGG ACTCGAACTT CAAGGTCCAG
CCGCAGATGG CGGAGTGGAA GGTCTCCGAG GACAAACTGA CCTACACCTT CACGCTGCGC
GACGGCCTGA AGTGGCACGA TGGCGCTCCG GTCACCGCCG AGGATTGCGT CGCCTCGCTG
AAGCGCTGGG GCCAGAAAGA CGGCATGGGC CAGAAGCTGA TGGACTTCAC GGCGAGCCTC
GAGGCCACCG ATGCGAAGAC TATCACGCTG AAGTTGAAGG AGCCCTACGG GCTGGTGCTG
GAGTCGATCG GCAAGCCATC GTCGCTGGTG CCGTTCATGA TGCCGAAGCG GATCGCCGAG
ACGCCGCCGG ACAAGGCGAT CCCCGAGCAG ATCGGCTCCG GTCCGTTCAA ATTCGTCGCC
GCGGAATTTC AGCCGGGCGT CAAGGCGGTT TACGTCAAGA ACGCCGACTA CGTGCCGCGC
AAGGAGGCGC CGAGCTGGAC GTCGGGCGGC AAGGTGGTGA AGGTCGACCG GGTCGAATGG
ATCACCATGC CCGACGCGCA GACCGCGGTG AACGCGCTGC AATCCGGCGA CATCGATTTC
ATCGAGAATC CGTCCTTCGA CATCCTGCCC GTTCTGAAGC AGGACAAGGA ATTGACGATC
CACACGCTGA GCCCGCTCGG CTTTCAGACG CTCGGCCGGA TGAACTTCCT GTATCCGCCG
TTCGATAACG TCAAGGTTCG CCGCGCCGCG TTTCTGGCGA TGAGCCAGAA GCCGGTGCTC
GATGCGCTGG TCGGCAACCC GCAATACTAC AAGGTCTGCG GCGCCGTGTT CGGCTGCGGC
ACGCCGCTGG CGTCGGACGT CGGCTCCGAG ACGTTGGTCA AGGGCAGCGG CATGGCGGAG
GCCAAGAAGC TGCTCGCCGA GTCCGGCTAT GACGGCACGC CGATCGCGCT GATGGCGCCG
GGCGACGTCG TCACCCTGAA GGCGCAGCCG ATCGTGGCGG CGCAGCTGTT GCGCGAGGCC
GGTTTCAAGG TCGACGTCCA GGCCACCGAC TGGCAGACCG TGGTGACGCG GCGCGCCAGC
CAGAAACCGC CGAAGGACGG CGGCTGGAAC ATGTTCTTCA CCAATTGGGC GGGTCCGGAC
ATTCTCAATC CGGTCGCCAA TGTTTCGACC GGGGGCAAGG GCAAGAACGG CGGCTGGTTC
GGCTGGGCGG AGGACGCCAG GGTTGAAGAG CTGCGCGACA AATTCGCCCG CGCGACCTCG
CCCGACGAGC AGAAGAAGCT CGCCGAGGAG ATCCAGAAAG AAGTCTACGA CAAGGTGATC
TATATTCCGC TCGGCCAGTA CACAGCGCCC AGCGTATGGC GCAACGAACT GACCGGCGTG
CTCGACGGCC CGGCGACGCC GGTGTTCTGG AATATCGACA AGAAGGAATA G
 
Protein sequence
MFHTSHWLRS VAVSKFAVPA LALAASLTLP AFADAKTIHA VMHSDLRVTD PGLTTAYITR 
DHGYMVYDTL LAMDSNFKVQ PQMAEWKVSE DKLTYTFTLR DGLKWHDGAP VTAEDCVASL
KRWGQKDGMG QKLMDFTASL EATDAKTITL KLKEPYGLVL ESIGKPSSLV PFMMPKRIAE
TPPDKAIPEQ IGSGPFKFVA AEFQPGVKAV YVKNADYVPR KEAPSWTSGG KVVKVDRVEW
ITMPDAQTAV NALQSGDIDF IENPSFDILP VLKQDKELTI HTLSPLGFQT LGRMNFLYPP
FDNVKVRRAA FLAMSQKPVL DALVGNPQYY KVCGAVFGCG TPLASDVGSE TLVKGSGMAE
AKKLLAESGY DGTPIALMAP GDVVTLKAQP IVAAQLLREA GFKVDVQATD WQTVVTRRAS
QKPPKDGGWN MFFTNWAGPD ILNPVANVST GGKGKNGGWF GWAEDARVEE LRDKFARATS
PDEQKKLAEE IQKEVYDKVI YIPLGQYTAP SVWRNELTGV LDGPATPVFW NIDKKE