Gene RPD_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1566 
Symbol 
ID4022046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1757378 
End bp1758658 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID637961761 
Productputative branched-chain amino acid ABC transport system substrate-binding protein 
Protein accessionYP_568704 
Protein GI91976045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.293644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.470298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA AAACCCTACT GAGCACGGCG TCGCTGGCGC TCGTGATCGC AACCGTCTCG 
GCGACCGCGC AGGCGCAGAT CGCGATCGGG CATCTCGCCG ACTATTCGGG CGGCACCTCT
GATGTCGGCA CGCCCTACGG CCAGGCCGTC GCCGACACCT TCGCATGGGT CAACAAGAAC
GGCGGCGTCG CCGGCAAGCA GCTCAATGTC GACAGCAACG ACTACGGCTA CCAGGTGCCG
CGCGCGATCG CGCTGTACAA GAAGTGGTCC GGCGGCGACA AAGTCGCAGC GATCATGGGC
TGGGGCACTG CCGACACCGA AGCGCTGACC GGCTTCCTCG CCCAGGATAA AATCCCCGAC
ATGTCCGGCT CCTACGCCGC CGCTTTGACC GATCCGGAAG GCACCAGCGG CAAGGCCAAG
CCCGCCCCCT ACAACTTCTT CTACGGCCCG AGCTATTCCG ACGCGCTGCG CGCCGAGCTG
ATGTGGGCGG CGGAGGACTG GAAGGCCAAG GGTAAGTCGG GCGCTCCGAA ATTCGTGCAT
ATGGGCGCCA ACCATCCCTA CCCCAACGCG CCGAAGGCTG CAGGCGAAGC CCTCGCCAAG
GAGCTCGGCT TCGAGGTGTT GCCGCCGCTG GTGTTCGCGC TGGCGCCGGG CGACTACAGC
GCCCAGTGTC TGAGCCTGAA GAGCTCGGGC GCCAACTACG CCTATCTCGG CAACACCGCG
GCCTCCAACA TCTCGGTGAT GAAGGCCTGC AAAGCGGCCG GGGTCGACGT GCAGTTCATG
AGCAACGTGT GGGGCATGGA CGAGAACGCC GCCAAGACCG CCGGCGACGC CGCCGATGGC
GTGATCTTCC CGCTGCGCAC CGCGGTCGCC TGGGGCGGCA ATGCGCCCGG CATGAAGACT
GTCGAGGAAG TCTCCAAGAT CTCCGACTCG ACCGGCAACG TCTATCGTCC GGTGCATTAC
GTCGCGGCGG TGTGCTCCGC GCTGTACATG AAGGAGGCGA TCGAATGGGC CGCCAAGAAC
GGCGGCGCCA CCGGCGAGAA CGTCGCCAAG GGCTTCTACC AGAAGAAGGA TTGGGTGCCG
GCCGGCATGG ACGGCGTCTG CAATCCCTCG ACCTGGACCG ACAAGGACCA CCGCGGAACG
ATGAAGGTCG ACCTGTATCG GTCGAAAGTG ACGGGACCGA CCGATGGCGA CATCAAGGAC
CTGATCGCCA AGGGCACGAT CAAGCTCGAG AAGGTCAAGA CCGTCGACCT GCCGCGCAAG
CCGGAATGGT TCGGCTGGTG A
 
Protein sequence
MTMKTLLSTA SLALVIATVS ATAQAQIAIG HLADYSGGTS DVGTPYGQAV ADTFAWVNKN 
GGVAGKQLNV DSNDYGYQVP RAIALYKKWS GGDKVAAIMG WGTADTEALT GFLAQDKIPD
MSGSYAAALT DPEGTSGKAK PAPYNFFYGP SYSDALRAEL MWAAEDWKAK GKSGAPKFVH
MGANHPYPNA PKAAGEALAK ELGFEVLPPL VFALAPGDYS AQCLSLKSSG ANYAYLGNTA
ASNISVMKAC KAAGVDVQFM SNVWGMDENA AKTAGDAADG VIFPLRTAVA WGGNAPGMKT
VEEVSKISDS TGNVYRPVHY VAAVCSALYM KEAIEWAAKN GGATGENVAK GFYQKKDWVP
AGMDGVCNPS TWTDKDHRGT MKVDLYRSKV TGPTDGDIKD LIAKGTIKLE KVKTVDLPRK
PEWFGW