Gene RPD_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1473 
Symbol 
ID4021952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1641660 
End bp1642880 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID637961667 
Productputative urea/short-chain binding protein of ABC transporter 
Protein accessionYP_568611 
Protein GI91975952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.267716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCCA AGCCTCTCGC GGCGGCGATG ATGACGGCTG CGCTTATGTC ATCCTCGACC 
GCATTCGCCC AGGTATCCGA CGACATTGTC AAGATCGGTG TACTGACCGA TATGAACGGT
CCCGCGTCGA CGCCGACCGG CCAGGGTTCG ATGACGGCCG CGCAAATGGC GATCGACGAT
TTCGGCGGCC AGGTGCTGGG CAAGCCGATC AGCGTCATCG TCGGCGACCA CCAGCTCAAG
CCCGACATCG GCGGCGCTCT GGCGCGGCGC TGGTACGACG TCGAACAGGT CGACCTGATC
GTCGACGTGC CGGTCTCCGC GGTCGGTCTC GCGGTTCAGA ACATCGCCAA CGAAAAGAAG
CGGATGTTCA TCACGCAATC GACCGGCGCC GCCGATTTTC ACGGCAAGTT CTGCAGCCCC
TACACGATGC AATGGGTGTT CGACACCCGG GCGCTGGCGG TCGGCACCGC GCAGGAGGTC
GTGAAACGCG GCGGCGACAC CTGGTTCTTC ATCACCGACG ACTACGCCTT CGGCCTGTCG
CTGGAGCGCG ACGCCGCGGC GGTGGTGACC AAGGCCGGCG GCAAGGTGAT CGGCTCGGTG
CGTCCGCCGT TCGCGACGCC GGACCTGTCG TCCTTCGTAC TTCAGGCGCA AGCCTCGAAG
GCCAAGATCA TCGGCATCGC CGGCGGCCCG CCGAACAACA TCAATGAAAT AAAGACCGGC
GCCGAGTTCG GCGTCTTCAA GGGCGGACAA CAGATGGCGG CGCTGCTGGC GTTGATCACC
GACATCCATT CGCTCGGCCT GCCCGCCGCG CAGGGCCTGT TACTGACGAC GTCGTTCTAT
TGGGACATGG ACGACAGGAC CCGCGAATGG TCGAAGCGCT ACTTCGCCAA GATGAACCGG
ATGCCGACGA TGTGGCAGGC CGGCGTGTAT TCCGCGGTGA CACACTATCT GCAAGGCATC
AAGGAGGCCG GCACCGACGA GCCGCTCAAG GTCGCCGCCA AGATGCGCGA GAAGCCGATC
GAGGATTTCT TCTCGCGCAA TGGCAAACTG CGCGAGGACG GTCTGATGGT GCATGACTTG
ATGCTGGTTC AGGTCAAGAG CCCGGAGGAG TCGAAATATC CGTGGGACTA TTACAAGATC
CTCGCGCATA TCTCCGGTGA AGAAGCGTTC GGCCCGCCCG ACCCGGCCTG CCCGTTGATC
AAGAAACAGG CGGCGAATTG A
 
Protein sequence
MFAKPLAAAM MTAALMSSST AFAQVSDDIV KIGVLTDMNG PASTPTGQGS MTAAQMAIDD 
FGGQVLGKPI SVIVGDHQLK PDIGGALARR WYDVEQVDLI VDVPVSAVGL AVQNIANEKK
RMFITQSTGA ADFHGKFCSP YTMQWVFDTR ALAVGTAQEV VKRGGDTWFF ITDDYAFGLS
LERDAAAVVT KAGGKVIGSV RPPFATPDLS SFVLQAQASK AKIIGIAGGP PNNINEIKTG
AEFGVFKGGQ QMAALLALIT DIHSLGLPAA QGLLLTTSFY WDMDDRTREW SKRYFAKMNR
MPTMWQAGVY SAVTHYLQGI KEAGTDEPLK VAAKMREKPI EDFFSRNGKL REDGLMVHDL
MLVQVKSPEE SKYPWDYYKI LAHISGEEAF GPPDPACPLI KKQAAN