Gene Rsph17029_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0959 
Symbol 
ID4897041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp989136 
End bp990221 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content68% 
IMG OID640111545 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001042842 
Protein GI126461728 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00272091 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACG CCATACGCCC CCAGCCCGGG ATTCTCGACA TTGCCCTCTA CGAGGGCGGC 
AAGAGCCATG TGGCGGGCAT CCAGAACGCG CTGAAGCTGT CGTCGAACGA GAACCCGTTC
GGCCCCTCGC CCAAGGCGAA GGAGGCTTTC CTGCGCTCGG TCCATACACT GCACCGCTAT
CCCTCGACCG ACCATGCGGG CCTGCGCCAT GCGATCGCCG AGGTGCACGG GCTCGATCCC
GCCCGCGTGA TCTGCGGCGT GGGCTCGGAC GAGATCATCA CCTTCCTGTG CCAGGCCTAT
GCCGGGCCGC ACACGGATGT CGTCTTCACC GAGCACGGCT TCCTCATGTA CCGGATCTCG
GCCCTGGCGG TCGGGGCCAA TCCGGTCGAG GTGCCCGAGC GCGAGCGCAC GACCGACGTG
GATGCGATCC TCGCCGCCTG CACGCCGCAC ACGCGGCTGG TGTTCCTCGC CAACCCCAAC
AACCCGACGG GCACCATGAT CGGGCAGGCC GATCTCGCGC GGCTGGCCGC GGGGCTGCCG
GCGCAGGCGA TCCTCGTGCT CGACGGGGCC TATGCCGAAT ATGTGCCGGG CTATGACGCG
GGCCGCGCCC TGATCGAGGA GCGCGGCAAC GTCGTCATGA CGCGGACCTT CTCGAAGATC
TACGGGCTGG GCGGGCTGCG CGTGGGCTGG GGTTACGGGC CGAAAGCCAT CATCGACGTG
CTGAACCGGA TCCGGGGGCC CTTCAACCTC TCCACCACAC AGCTCGAGAC CGCCGAGGCC
GCGGTGCGCG ATCAGGACCA TGTCGCCCGC TGCCGCGCCG ACAATGCGCG CTGGCGCATC
TGGCTGGCCG AAGCGCTGGC GGAAATCGGC GTGCCGTCCG ATACATCGAT GGCGAACTTC
ATCCTCGCCC GCTTCTCGGA TACCGAGGAG GCCGAGGCCT GCGACCTCCA TCTGCAGACG
CAGGGGCTGA TCGTGCGCCG CGTCGCGGGC TACAAGCTGC CGCACTGCCT GCGCATCACC
ATCGGCGACG AGGCCTCCTG CCGCCGCGTC GCCCATGCGA TCGGCCAGTT CAAGAGGATG
CGCTGA
 
Protein sequence
MSDAIRPQPG ILDIALYEGG KSHVAGIQNA LKLSSNENPF GPSPKAKEAF LRSVHTLHRY 
PSTDHAGLRH AIAEVHGLDP ARVICGVGSD EIITFLCQAY AGPHTDVVFT EHGFLMYRIS
ALAVGANPVE VPERERTTDV DAILAACTPH TRLVFLANPN NPTGTMIGQA DLARLAAGLP
AQAILVLDGA YAEYVPGYDA GRALIEERGN VVMTRTFSKI YGLGGLRVGW GYGPKAIIDV
LNRIRGPFNL STTQLETAEA AVRDQDHVAR CRADNARWRI WLAEALAEIG VPSDTSMANF
ILARFSDTEE AEACDLHLQT QGLIVRRVAG YKLPHCLRIT IGDEASCRRV AHAIGQFKRM
R