Gene Synpcc7942_1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1714 
Symbol 
ID3775413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1782216 
End bp1783232 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content57% 
IMG OID637800152 
Producthypothetical protein 
Protein accessionYP_400731 
Protein GI81300523 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCG GCACGTTTAG ACTGCCGAAC TTTGAGCTGG ACTGCGGTGC TGTCCTTCCC 
GAGGCGAGCC TCGTTTACGC CACCTACGGA GAACTCAACC GCGATCGCAG CAATGCGATT
CTCTACCCGA CCTCCTACGG TGCTCAGCAT TCGACGATCG ACTGGCTGAT TGGAGGCGAT
CGCATCCTCG ATCCCGATCG CTGGTTCATC GTCATCGTCA ATCAGTTCGG GAATGGGCTT
TCCAGTTCTC CCAGCAACGA TCCGGCTTGC GGACTGGCAG AGCAGGGGTT TTGGTTCAGT
CATTGGGACA GTGTCTGTGC TCAACAGGCT CTCCTCAGCC AAGTACTGGG CATTGAGCAA
CTGGCGCTGA TCTACGGCTG GTCGATGGGG GCACAGCAGG CTTATCACTG GGCGATCGCT
TTCCCCGATC GCGTCCAGCG GATTGCAGCA CTCTGTGGAA CGGCAAAAAC GACCGAGCAT
AACCGGTTGT TTCTGGAGAG CCTTCGCGCT GCGTTGATCG CTGATCCAAC TTGGGATGGT
CAACGGTTTC AAGCCACTCC CGATCGCGGT TACAAAGCCT TTGCGCGGAT CTATGCCAGT
TGGGCTGCGT CTCAGGCCTT TTATCGAGCG GGTATTTACC GGCAGCAGGG CTACAGTTCG
CTAGAGGATT ATTTGGAACG GGGTTGGGAA GCGAACTATC GTCAGCGCGA TCCCCACGAT
CTACTGGCGA TGATCGACAC CTGGTTGCGC TGTGATGTCA GCGATCGCCC TGCTTTTGGG
GGTGATTTAG CCAAGGCACT CGGCAGCATT ACGGCTCAGA CCTTGGTCAT GCCCTCGACA
ACAGATCTCT ACTTCACCCC AGAGGATTGT GAGGCCGAAG CGCAGTTGAT TCCTAAGGCG
CACTATTGCC CAATTCCCTC GATCTGGGGT CACCGCGCGG GCAACCCCAG CCAAAATCCG
CAGGATGAAA GCTTCATTCG GCAGGCCGTT CAGGCTTTGC TCAACGCTGA AGCCTAG
 
Protein sequence
MTVGTFRLPN FELDCGAVLP EASLVYATYG ELNRDRSNAI LYPTSYGAQH STIDWLIGGD 
RILDPDRWFI VIVNQFGNGL SSSPSNDPAC GLAEQGFWFS HWDSVCAQQA LLSQVLGIEQ
LALIYGWSMG AQQAYHWAIA FPDRVQRIAA LCGTAKTTEH NRLFLESLRA ALIADPTWDG
QRFQATPDRG YKAFARIYAS WAASQAFYRA GIYRQQGYSS LEDYLERGWE ANYRQRDPHD
LLAMIDTWLR CDVSDRPAFG GDLAKALGSI TAQTLVMPST TDLYFTPEDC EAEAQLIPKA
HYCPIPSIWG HRAGNPSQNP QDESFIRQAV QALLNAEA