Gene RPD_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2999 
Symbol 
ID4023502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3340256 
End bp3341482 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content63% 
IMG OID637963198 
Productputative urea/short-chain binding protein of ABC transporter 
Protein accessionYP_570126 
Protein GI91977467 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0873398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.403324 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTAG TCTTCTCGCG TCGTCATGCC GTGGCTATCG CGCTGGCAAG CGCCGCGTTT 
ACCTCGCCGG TACTCGCTCA GGACAAGACC GCAAAAATCG GCGTGCTCAA CGACATGTCG
AGCCTCTACG CCGATATCGG CGGACCGAAC TCGGTCGTCT CCGCCAAGCT GGCGATCGCT
GACTCCGGTC TTGAGGCGAA GGGCTGGAAG ATCGAGCTGC TCGCCGGCGA TCATCAGAAC
AAGCCGGACA TCGGCGTCAA CGTCGCCCGT CAGTGGATTG ACGTCGACAA GGTTGACTTG
ATCACTGACA CGCCGAACTC GGGCGTGGCG CTCGCGATCA GCAATCTGGT CAAGGAAAAG
AACAGCATCC TGATGAATTC AGGCGGCGCC AGCGCCGACC TGACGGGCAA GCAGTGCACC
CCCAACACCA TCTCGATGAC TTACGACACC TACATGCTGG CGCACGGCAC CGGTCAGGCG
CTGACCAAGG CCGGCGGCAA TAGCTGGTTC TTCCTGACCG CGGATTACGC GTTCGGAGCG
GCGCTCGAGC GCGATACGAC GGCGGTGGTC AAAGCCAATG GCGGCCAGGT GCTCGGCGGC
GTCAAACACC CGCTCAACAC CGCCGACTTC TCGTCGTTCC TGCTGCAGGC GCAGGCGTCC
AAGGCCAAGG TCATCGGCCT CGCCAATGCC GGCGGCGACA CCACCAACTC GATCAAGCAG
GCTTCGGAGT TCGGCATCAC CGCCGGCGGG CAGAAGCTCG CGGCGCTGCT GCTGTTCGTC
AACGACGTCC ACTCGCTCGG CCTCAAGGTC GCGCAGGGAC TGACCTTCAC CGAGTCCTAC
TACTGGGATC TCAACGATAA TACCCGCGCC TTCGCCAAGC GCTTCTCGGA GCAGTCCAAG
AACAACGCCA AGCCGTCGAT GACCCAGGCT GGCGTCTATG CCGCCGTGCT GCATTATCTC
AAGACGCTCG ACGCGATGGG CGGCAACCCA CACGACGGCG CCAAGGTTGT AGCCAAGATG
AAGGAGATCC CGGCCGACGA CGTGCCGTTC GGTAAGTCGG TTATCCGCGC CGACGGCCGC
CGCCTGGTTC CGGCCTATCT GTTCGAGGTG AAGTCGCCCG CCGAGTCAAA GGGGCCGTGG
GACTACTACA AGAAGATCGC GGACATCTCC GCCGAGGATG CGGCCCGTCC GTTGTCCGAG
AGCGAATGCC CGCTGGTGAA GAAATAA
 
Protein sequence
MRVVFSRRHA VAIALASAAF TSPVLAQDKT AKIGVLNDMS SLYADIGGPN SVVSAKLAIA 
DSGLEAKGWK IELLAGDHQN KPDIGVNVAR QWIDVDKVDL ITDTPNSGVA LAISNLVKEK
NSILMNSGGA SADLTGKQCT PNTISMTYDT YMLAHGTGQA LTKAGGNSWF FLTADYAFGA
ALERDTTAVV KANGGQVLGG VKHPLNTADF SSFLLQAQAS KAKVIGLANA GGDTTNSIKQ
ASEFGITAGG QKLAALLLFV NDVHSLGLKV AQGLTFTESY YWDLNDNTRA FAKRFSEQSK
NNAKPSMTQA GVYAAVLHYL KTLDAMGGNP HDGAKVVAKM KEIPADDVPF GKSVIRADGR
RLVPAYLFEV KSPAESKGPW DYYKKIADIS AEDAARPLSE SECPLVKK