Gene Synpcc7942_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1030 
Symbol 
ID3773960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1044888 
End bp1046009 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content56% 
IMG OID637799452 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_400047 
Protein GI81299839 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCCGT TTTTGCGTTC TGAGCTTGCT CGTTGCCAGC CCTACCACCC CAATCCGGGC 
GGGACTGGTA TGGCGATGGA CATTCTCGAC ACTAATGAGT GTCCCTATGA TCTACCGACC
GATCTCAAGC AAACGCTGGC AGATCGCTAT GTGGAGGCGA TCGCGTCCAA CCGCTATCCC
GATGGCAGCC ACACGGATCT CAAGGCGGCG ATCGTCGACT ACTTGAGTGA GCAAACCGCT
GGGCAATGGC AACCGGGGCC AGAGCACGTC ACTGTGGGCA ACGGCTCCGA TGAGCTGATT
CGCTCGATCT TGATTGCCAC GTGCCTGGGT GGACAAGGCT CGGTCTTGGT GGCGGAGCCA
ACCTTCTCGA TGTACGGGAT TGTGGCAGAG ACTTTGGGGA TTCCTGTAGT GCGGATCGGC
CGCGATCCCC AAACTTGGGA GATGGATCTG GCGGCGGCCG AAACTGCGAT TACCCAAACG
GAGGGCACGC CAGTTCGCCT CTGTTTCGTT GTCCATCCCA ACTCACCCAC CGCAAACCCG
CTGACGGAAG CGGAAAAAGA CTGGCTGCGC CAAGTCCCGC CCCAGATTTT AGTGGTAATT
GATGAGGCCT ATTTTGAATT CAGCGGCGAA ACCCTGCTGG CTGAGTTACC CCAACACCCC
AACTGGCTGA TTACCCGTAC CTTTTCTAAA GCGTTGCGAC TGGCCGCCCA TCGTGTTGGC
TACGGCATTG GGGATCCCCA ACTGATTGCT GCCCTCGAAG CAATTCGGTT ACCCTACAAT
CTGCCAAGCG TGGCTCAATT GGCAGCAACT CTTGCCCTCG AGGCGCGATC GCAACTACTC
TCAGCCATCC CTAGATTGAT CACTGAACGC GATCGCCTCT ATCGAAAACT GCAAGTCGTT
TCCCAGCTTC AAGTCTGGCC TAGCGCTAGC AATTTCCTAT TTTTAAAAAC GCAGTCTTCC
TCACAAACTG CAGCGTTAGC CGCACAACTC AAAGCTCAGG GAACGTTGGT GCGCCACACC
GCTGACGGAC TGCGAATTAC GATTGGTAGC CCAGCAGAAA ATGAGCGGAC TTTAGCTCAT
CTGCAAACAG CAATCACTCA ATCATTACCA GCGACGGTCT AG
 
Protein sequence
MLPFLRSELA RCQPYHPNPG GTGMAMDILD TNECPYDLPT DLKQTLADRY VEAIASNRYP 
DGSHTDLKAA IVDYLSEQTA GQWQPGPEHV TVGNGSDELI RSILIATCLG GQGSVLVAEP
TFSMYGIVAE TLGIPVVRIG RDPQTWEMDL AAAETAITQT EGTPVRLCFV VHPNSPTANP
LTEAEKDWLR QVPPQILVVI DEAYFEFSGE TLLAELPQHP NWLITRTFSK ALRLAAHRVG
YGIGDPQLIA ALEAIRLPYN LPSVAQLAAT LALEARSQLL SAIPRLITER DRLYRKLQVV
SQLQVWPSAS NFLFLKTQSS SQTAALAAQL KAQGTLVRHT ADGLRITIGS PAENERTLAH
LQTAITQSLP ATV