Gene PCC8801_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1472 
Symbol 
ID7103670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1544000 
End bp1545046 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content38% 
IMG OID643474547 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002371684 
Protein GI218246313 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCGA TTCGAGAATG CGTTAGCCAA ACCCCAGGAT ATGTCCCTGG AGAACAACCC 
CAAACCACGG ATTATATTAA ACTTAATACC AATGAAAACC CCTATCCTCC CCCTGATAAA
ATTTTTGAGG GACTGCAACA AGAATTAACA AAAGTTAGAT TATATCCTGA TCCTGTTTCA
ACTCAATTAA GGAAAGCTGC TGCTAATGTT TTTGGTATTT CCTATCAGAA TATTTTAGCA
GGAAATGGCT CAGATGACAT TTTAAATATT GCAGTCAGAA CCTTTGTTAA TCCGGGGGAA
GTTGTCGCTT TTCTCGATTT AACCTATTCC TTGTATGAGA CGATCGCACG GGTTCATGGT
GCTTCTATTG TCCAAATTCC TACCAATAAT CAATTTGAAT TAAACGGACC GATTATTTGT
CCTGAGGCTA AACTAATTTT TGTCGCTTCT CCTAATCCTC CTGTGGGAAA ACACCTGAAC
CGAGACTATC TTGAAGAAAC CTGTAAACAG GCAACGGGAG TGGTATTAAT TGATGAAGCG
TATGTGGATT TTAGCGATGA AAATCATCTA GACTTTTTAG AAAAATACGA CAATGTTATC
ATTTCTCGTA CCATGTCTAA GAGTTATAGT TTAGCGGGAA TGCGAGTCGG TTTTGGGGTG
AGTTCAACGG AAATTATTGA ACAAATGGAT AAGGTAAGAG ATTCCTATAA TTTAGATAGA
ATCGCTCAAA CTTTAGGAAC AGCAGTATTA AATTATCAGG ACTATTTTAA AGGGGTTTGG
CAACAAGTTC GTCACACCCG TACTCGGTTA ATTGAATCTT TGCGAACCTT AGAGTTTTTG
GTGTTTGATT CTGATTCTAA TTTTGTGCTG GCATCTCCAC AATGGATAGC TGCATCGGAT
CTTTATACAC AGTTAAAAGA GAGAAAAGTC CTAGTCAGAT ATTTTAGTCA TCCTCGCATT
AAAGACTATG TTAGAATTTC CATTGGAACC GATCAAGAAA TTGATCGCTT ATTAGAAGCT
ATCCATGAAA TTAAAGGGAG TAACTAA
 
Protein sequence
MLPIRECVSQ TPGYVPGEQP QTTDYIKLNT NENPYPPPDK IFEGLQQELT KVRLYPDPVS 
TQLRKAAANV FGISYQNILA GNGSDDILNI AVRTFVNPGE VVAFLDLTYS LYETIARVHG
ASIVQIPTNN QFELNGPIIC PEAKLIFVAS PNPPVGKHLN RDYLEETCKQ ATGVVLIDEA
YVDFSDENHL DFLEKYDNVI ISRTMSKSYS LAGMRVGFGV SSTEIIEQMD KVRDSYNLDR
IAQTLGTAVL NYQDYFKGVW QQVRHTRTRL IESLRTLEFL VFDSDSNFVL ASPQWIAASD
LYTQLKERKV LVRYFSHPRI KDYVRISIGT DQEIDRLLEA IHEIKGSN