Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1257 |
Symbol | |
ID | 4021734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1419930 |
End bp | 1420970 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637961450 |
Product | aminotransferase, class I and II |
Protein accession | YP_568396 |
Protein GI | 91975737 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01140] L-threonine-O-3-phosphate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.644499 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGGCAG CGGCGGCTGC GACGGCGCTC CGCATTCACG GCGGCCGCGT CGATCTCGCA GCAAGCGCCT ATCCTGACGC GCCGCAGCCC TGGATCGATC TTTCGACCGG CATCAATCCG ACCGCCTATC CGATCCCGAC GCTCGCAGCA GCGGCCTTTG CACGGCTGCC GCTGACGACG GAACTCGACG AATTATGCGC GGCCGCGGCC GAAGCCTATG GGCTGCCCGG CGGCGCAGTG GTGCTTCCCG CGCCGGGCAG CGAGATCGCA ATCCGGCTGG CGCCGCTCGT GCTGACGCCG CCACAGCCAT CGATCGCGCA AGTCGGCATC CTGGGACCGA CCTATGGCTC GCACGCCGCC GCCTGGCGCG CGGCAGGCGC GCAGGTGCAT GAGCTCGGCG CCCTGCCCGA TCCGCAGGCG CACTTCGATG TCGTGGTGCT CGTCAACCCG AACAATCCGG ATGGACACCT GATCGCGCCG GAGCCACTTG CGGACTTTGC CGAGTGCTGG ACCGCATCGG GCAAACGCCT GGTGATCGAC GAAGCGTTCG GCGACCTCAG GCCGGAATTG TCGGTGCTTG GCGGCGCGGC GTTGCCGCGC GGGGTGGTGG TGCTGCGGTC GCTCGGAAAA TTCTTCGGGC TGGCAGGACT GCGGCTCGGC TTCGTGGTGG TCAATGCCGG CGATGCGCCG CGATGGCGTG ATCTGCTCGG CGACTGGGCC GCGTCCGGAC CGGCCTGCAC GATCGCAACC GCTGCGCTGC GCGACAAGGC ATGGATCGCG GCGACGCGCA GCCGGCTTGC CGCCGACCGC GGCCGGCTCG ACGCAACACT CGCCGCAGCG CAGCTCGAAC CGCGCGGCGG AACCGATCTG TTCGGGTTCT ACGAAAGCGC GGACGACGGC GATCTGGTCG ACCGGTTCGC GCGCGCCGGC ATCTTGATCC GCGGCTTCGA TCACAGCCCA CGGCTTTATC GCTTCGGCCT GCCCGCGGAC GAGCCCGCCT GGCAACGGCT GCAGCGGATC TGCGATACTG TTGCGGGCTA G
|
Protein sequence | MTAAAAATAL RIHGGRVDLA ASAYPDAPQP WIDLSTGINP TAYPIPTLAA AAFARLPLTT ELDELCAAAA EAYGLPGGAV VLPAPGSEIA IRLAPLVLTP PQPSIAQVGI LGPTYGSHAA AWRAAGAQVH ELGALPDPQA HFDVVVLVNP NNPDGHLIAP EPLADFAECW TASGKRLVID EAFGDLRPEL SVLGGAALPR GVVVLRSLGK FFGLAGLRLG FVVVNAGDAP RWRDLLGDWA ASGPACTIAT AALRDKAWIA ATRSRLAADR GRLDATLAAA QLEPRGGTDL FGFYESADDG DLVDRFARAG ILIRGFDHSP RLYRFGLPAD EPAWQRLQRI CDTVAG
|
| |