Gene RPD_3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3864 
Symbol 
ID4024380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4302470 
End bp4303732 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content61% 
IMG OID637964068 
Productamide-urea binding protein 
Protein accessionYP_570986 
Protein GI91978327 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.601855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACA AGAAAAAGCA GGGCCTTCAC TCGCCGTTCC GACGCAAGCT TCTGATGGGC 
ATGGCCGCGA TCCCGGCGAT GTCGCTGCTG CCGCGAACAT CGTTCGCCCA GGCGCCGGCG
ACTTCCGTCG TCAACACCAC CGGCCTGGCA GTGACCGATA CGGAGGTCAC TGTCGGAATC
CTGCACTCGG TCACGGGCAC GATGGCGATC TCGGAGACCG GTTCGGTGCA GGCCGAGAAG
CTGGCGATCG AGCAGATCAA CGCCGCCGGC GGCGTGCTCG GCCGCAAGAT CAAGTTCATC
CAGGAAGACG GCGCGTCCGA TTGGCCGAAT TTCGCCGAGA AGGCCAAGAA GCTTCTGGTC
AACGACAAAT GCGCCGCGGT GATGGGCTGC TGGACCTCGG CCTCGCGCAA GGCGGTGCTG
CCGGTGTTCG AGCAATACAA CGGCATGTTG TACTACCCGA CCTTCTACGA AGGCCTGGAG
CAGTCCAAGA ACGTCATCTA CACCGGCCAG GAGGCCACCC AGCAGATCAT CGCCGGCCTC
GATTGGGTCA ACAAGACCAA GGGCGCCAAG AGCTTCTATC TGCTCGGCTC GGACTACATC
TGGCCGCGCA CCTCCAACAA GATCGCGCGC AAGCACATCG AAAGCCATCT GAAGGACGCC
AAGGTGGTCG GCGAGGAGTA CTTCCCGCTC GGTCACACCC AATTCAACTC GGTGATCAAC
AAGATCAAGC TCACCAAGCC GGACGTGATC TACGCGATCA TCGTCGGCGG TTCGAATGTC
GCGTTCTACA AGCAGCTCAA GGCGGCCGGC ATCGACCTGT CGAAGCAGAC GCTGTTGACG
ATCTCGGTCA CCGAGGACGA GATCGACGGC ATCGGCGGCG AGAACATCGC GGGAGCCTAT
GCCTGCATGA AGTACTTCCA GTCGCTCGAC AATCCGAACA ACAAGGAATT CGTCGCCGCA
TTCAAGAAGA TGTGGGGCGA GAAGACTGTG ATCGGAGACG TCACCCAGGC TGCCTATCTC
GGCCCGTGGC TGTGGAAGTT GACCGTGGAG AAGGCCGGCT CGTTCGACGT CGACAAGGTG
GCGGCCGCGT CGCCGGGCGT GGAATTCAAG GGCGCGCCGG AAGGCTACGT TCGGGTCCAC
GAGAATCACC ACCTCTGGTC GAAGACCAGG GTCGGTCGCG CCAAGCTCGA TGGCCAGTAC
GAACTGGTCT ACGAGACCGC CGATCTGGTC GAACCGGACC CGTTCCCGAA GGGCTATCAG
TAA
 
Protein sequence
MSDKKKQGLH SPFRRKLLMG MAAIPAMSLL PRTSFAQAPA TSVVNTTGLA VTDTEVTVGI 
LHSVTGTMAI SETGSVQAEK LAIEQINAAG GVLGRKIKFI QEDGASDWPN FAEKAKKLLV
NDKCAAVMGC WTSASRKAVL PVFEQYNGML YYPTFYEGLE QSKNVIYTGQ EATQQIIAGL
DWVNKTKGAK SFYLLGSDYI WPRTSNKIAR KHIESHLKDA KVVGEEYFPL GHTQFNSVIN
KIKLTKPDVI YAIIVGGSNV AFYKQLKAAG IDLSKQTLLT ISVTEDEIDG IGGENIAGAY
ACMKYFQSLD NPNNKEFVAA FKKMWGEKTV IGDVTQAAYL GPWLWKLTVE KAGSFDVDKV
AAASPGVEFK GAPEGYVRVH ENHHLWSKTR VGRAKLDGQY ELVYETADLV EPDPFPKGYQ