Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0959 |
Symbol | |
ID | 4897041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 989136 |
End bp | 990221 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640111545 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001042842 |
Protein GI | 126461728 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00272091 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACG CCATACGCCC CCAGCCCGGG ATTCTCGACA TTGCCCTCTA CGAGGGCGGC AAGAGCCATG TGGCGGGCAT CCAGAACGCG CTGAAGCTGT CGTCGAACGA GAACCCGTTC GGCCCCTCGC CCAAGGCGAA GGAGGCTTTC CTGCGCTCGG TCCATACACT GCACCGCTAT CCCTCGACCG ACCATGCGGG CCTGCGCCAT GCGATCGCCG AGGTGCACGG GCTCGATCCC GCCCGCGTGA TCTGCGGCGT GGGCTCGGAC GAGATCATCA CCTTCCTGTG CCAGGCCTAT GCCGGGCCGC ACACGGATGT CGTCTTCACC GAGCACGGCT TCCTCATGTA CCGGATCTCG GCCCTGGCGG TCGGGGCCAA TCCGGTCGAG GTGCCCGAGC GCGAGCGCAC GACCGACGTG GATGCGATCC TCGCCGCCTG CACGCCGCAC ACGCGGCTGG TGTTCCTCGC CAACCCCAAC AACCCGACGG GCACCATGAT CGGGCAGGCC GATCTCGCGC GGCTGGCCGC GGGGCTGCCG GCGCAGGCGA TCCTCGTGCT CGACGGGGCC TATGCCGAAT ATGTGCCGGG CTATGACGCG GGCCGCGCCC TGATCGAGGA GCGCGGCAAC GTCGTCATGA CGCGGACCTT CTCGAAGATC TACGGGCTGG GCGGGCTGCG CGTGGGCTGG GGTTACGGGC CGAAAGCCAT CATCGACGTG CTGAACCGGA TCCGGGGGCC CTTCAACCTC TCCACCACAC AGCTCGAGAC CGCCGAGGCC GCGGTGCGCG ATCAGGACCA TGTCGCCCGC TGCCGCGCCG ACAATGCGCG CTGGCGCATC TGGCTGGCCG AAGCGCTGGC GGAAATCGGC GTGCCGTCCG ATACATCGAT GGCGAACTTC ATCCTCGCCC GCTTCTCGGA TACCGAGGAG GCCGAGGCCT GCGACCTCCA TCTGCAGACG CAGGGGCTGA TCGTGCGCCG CGTCGCGGGC TACAAGCTGC CGCACTGCCT GCGCATCACC ATCGGCGACG AGGCCTCCTG CCGCCGCGTC GCCCATGCGA TCGGCCAGTT CAAGAGGATG CGCTGA
|
Protein sequence | MSDAIRPQPG ILDIALYEGG KSHVAGIQNA LKLSSNENPF GPSPKAKEAF LRSVHTLHRY PSTDHAGLRH AIAEVHGLDP ARVICGVGSD EIITFLCQAY AGPHTDVVFT EHGFLMYRIS ALAVGANPVE VPERERTTDV DAILAACTPH TRLVFLANPN NPTGTMIGQA DLARLAAGLP AQAILVLDGA YAEYVPGYDA GRALIEERGN VVMTRTFSKI YGLGGLRVGW GYGPKAIIDV LNRIRGPFNL STTQLETAEA AVRDQDHVAR CRADNARWRI WLAEALAEIG VPSDTSMANF ILARFSDTEE AEACDLHLQT QGLIVRRVAG YKLPHCLRIT IGDEASCRRV AHAIGQFKRM R
|
| |