Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3864 |
Symbol | |
ID | 4024380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4302470 |
End bp | 4303732 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637964068 |
Product | amide-urea binding protein |
Protein accession | YP_570986 |
Protein GI | 91978327 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR03407] urea ABC transporter, urea binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.601855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGACA AGAAAAAGCA GGGCCTTCAC TCGCCGTTCC GACGCAAGCT TCTGATGGGC ATGGCCGCGA TCCCGGCGAT GTCGCTGCTG CCGCGAACAT CGTTCGCCCA GGCGCCGGCG ACTTCCGTCG TCAACACCAC CGGCCTGGCA GTGACCGATA CGGAGGTCAC TGTCGGAATC CTGCACTCGG TCACGGGCAC GATGGCGATC TCGGAGACCG GTTCGGTGCA GGCCGAGAAG CTGGCGATCG AGCAGATCAA CGCCGCCGGC GGCGTGCTCG GCCGCAAGAT CAAGTTCATC CAGGAAGACG GCGCGTCCGA TTGGCCGAAT TTCGCCGAGA AGGCCAAGAA GCTTCTGGTC AACGACAAAT GCGCCGCGGT GATGGGCTGC TGGACCTCGG CCTCGCGCAA GGCGGTGCTG CCGGTGTTCG AGCAATACAA CGGCATGTTG TACTACCCGA CCTTCTACGA AGGCCTGGAG CAGTCCAAGA ACGTCATCTA CACCGGCCAG GAGGCCACCC AGCAGATCAT CGCCGGCCTC GATTGGGTCA ACAAGACCAA GGGCGCCAAG AGCTTCTATC TGCTCGGCTC GGACTACATC TGGCCGCGCA CCTCCAACAA GATCGCGCGC AAGCACATCG AAAGCCATCT GAAGGACGCC AAGGTGGTCG GCGAGGAGTA CTTCCCGCTC GGTCACACCC AATTCAACTC GGTGATCAAC AAGATCAAGC TCACCAAGCC GGACGTGATC TACGCGATCA TCGTCGGCGG TTCGAATGTC GCGTTCTACA AGCAGCTCAA GGCGGCCGGC ATCGACCTGT CGAAGCAGAC GCTGTTGACG ATCTCGGTCA CCGAGGACGA GATCGACGGC ATCGGCGGCG AGAACATCGC GGGAGCCTAT GCCTGCATGA AGTACTTCCA GTCGCTCGAC AATCCGAACA ACAAGGAATT CGTCGCCGCA TTCAAGAAGA TGTGGGGCGA GAAGACTGTG ATCGGAGACG TCACCCAGGC TGCCTATCTC GGCCCGTGGC TGTGGAAGTT GACCGTGGAG AAGGCCGGCT CGTTCGACGT CGACAAGGTG GCGGCCGCGT CGCCGGGCGT GGAATTCAAG GGCGCGCCGG AAGGCTACGT TCGGGTCCAC GAGAATCACC ACCTCTGGTC GAAGACCAGG GTCGGTCGCG CCAAGCTCGA TGGCCAGTAC GAACTGGTCT ACGAGACCGC CGATCTGGTC GAACCGGACC CGTTCCCGAA GGGCTATCAG TAA
|
Protein sequence | MSDKKKQGLH SPFRRKLLMG MAAIPAMSLL PRTSFAQAPA TSVVNTTGLA VTDTEVTVGI LHSVTGTMAI SETGSVQAEK LAIEQINAAG GVLGRKIKFI QEDGASDWPN FAEKAKKLLV NDKCAAVMGC WTSASRKAVL PVFEQYNGML YYPTFYEGLE QSKNVIYTGQ EATQQIIAGL DWVNKTKGAK SFYLLGSDYI WPRTSNKIAR KHIESHLKDA KVVGEEYFPL GHTQFNSVIN KIKLTKPDVI YAIIVGGSNV AFYKQLKAAG IDLSKQTLLT ISVTEDEIDG IGGENIAGAY ACMKYFQSLD NPNNKEFVAA FKKMWGEKTV IGDVTQAAYL GPWLWKLTVE KAGSFDVDKV AAASPGVEFK GAPEGYVRVH ENHHLWSKTR VGRAKLDGQY ELVYETADLV EPDPFPKGYQ
|
| |