Gene Syncc9902_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1068 
Symbol 
ID3742491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1028920 
End bp1030449 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content55% 
IMG OID637771244 
Productthreonine dehydratase 
Protein accessionYP_377076 
Protein GI78184641 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATT ACCTGCAAAA GATCCTCCGA GCACGTGTTT ACGACGTGGC CAGGGAAACC 
CCTCTTGATC CAGCTCCCAA CCTCAGCCAG CGGCTCAACA ATCAGATTTG GTTGAAACGA
GAAGACCTAC AGCCGGTGTT TTCGTTCAAG CTTCGAGGCG CCTACAACCG CATGGTGCAA
CTCCCCGTTG AGGATCTTCA ACGCGGCGTG ATCGCATCCA GTGCAGGCAA CCACGCCCAA
GGAGTGGCTC TCAGTGCGCG CCGTTTGGGT TGCAGGTCGG TGATTGTGAT GCCCAAAACA
ACTCCCGAAG TCAAAATCCG CGCCGTCCGT GCCTTAGGCG GCGAAGTGGT TTTGCACGGC
GAGACCTACG ACGAATGCTC GGCGGAAGCG CAAAAGCGTT GTGAAGCTGA AGGATTGACC
TACATCCACC CCTTCGACGA CCCAGAGGTC ATCGCTGGTC AAGGCACGAT TGGCATGGAA
ATCATGCGAC AAAGCAAAGA ACCACCGGAT GCGATTTACG TCGCCGTTGG TGGAGGGGGA
TTGATCGCCG GAATTTCGGC CTACGTGAAG CAGCTTTGGC CAGAAACAGA AGTGATCGGA
GTCGAACCCA TTGATGCCGA TGCGATGACC CAATCCCTGC GGTGCGGACA TCGCGTGGAG
TTGGAGAAAG TGGGTTTATT TGCCGATGGC GTCGCCGTTC GAAAGGTTGG TGAGCACACC
TTTGATCTGG CTCGTCGCTA TGTGGATCGC ATGGTGACCG TGGACACCGA TGCCATTTGT
GCCGCCATCA AAGACGTCTT TGAGGACACA CGATCAATCC TTGAACCAGC TGGAGCGCTT
GCGATCGCTG GTTTGAAACA GGATGTGGCT GATCGAAACC TCGAAGGGAA GCGTCTTGTT
GGGGTGGCCT GCGGGGCCAA CATGAATTTC GAACGATTGC GCTTCATCGC AGAGCGGGCT
GAGCTCGGTG AGGAGCGAGA AGCGATGTTT GCCGTGGAAA TCCCCGAGTC GACAGGAAGT
CTTCGTGCGT TGTGTCGATG TTTAAGCGAC CGCAGCCTCA CCGAATTCAG TTATCGAATG
ACGGAGGGAG TGCGCGCCCA AATCTTCATT GGAGTTCAGG TCAACAATCA GCAAGACCGC
TTCGCCCTCG TCGAGCATCT GTCCAACAAC GGGTTCCCCT GCCTTGATTT GAGCGACAAC
GAGTTTGCGA AAGTTCATTT GCGACACATG GTGGGTGGAC GACTACCGAA ATCAGCTCGC
GAGGCCTGCG CGGGGGACTG CAGCGAATTG CTTTATCGCT TCGAGTTTCC AGAACGACCC
GGTGCGCTGA TGAATTTTGT GACGTCGCTT CATCCCAGTT GGAGCATCAG CATTTTTCAC
TATCGAAACC ACGGCGCCGA CACCGGCCGA ATCGTGGTGG GCGTTTTGAT TCCTAAGACT
GAAATGGACG GTTGGACCAG CTTTCTAAGT GCGTTGGGTT ATGCCTTTTG GGAAGAAAGT
AAGAACCCTG CTTACAGCCT GTTCCTGTGA
 
Protein sequence
MTDYLQKILR ARVYDVARET PLDPAPNLSQ RLNNQIWLKR EDLQPVFSFK LRGAYNRMVQ 
LPVEDLQRGV IASSAGNHAQ GVALSARRLG CRSVIVMPKT TPEVKIRAVR ALGGEVVLHG
ETYDECSAEA QKRCEAEGLT YIHPFDDPEV IAGQGTIGME IMRQSKEPPD AIYVAVGGGG
LIAGISAYVK QLWPETEVIG VEPIDADAMT QSLRCGHRVE LEKVGLFADG VAVRKVGEHT
FDLARRYVDR MVTVDTDAIC AAIKDVFEDT RSILEPAGAL AIAGLKQDVA DRNLEGKRLV
GVACGANMNF ERLRFIAERA ELGEEREAMF AVEIPESTGS LRALCRCLSD RSLTEFSYRM
TEGVRAQIFI GVQVNNQQDR FALVEHLSNN GFPCLDLSDN EFAKVHLRHM VGGRLPKSAR
EACAGDCSEL LYRFEFPERP GALMNFVTSL HPSWSISIFH YRNHGADTGR IVVGVLIPKT
EMDGWTSFLS ALGYAFWEES KNPAYSLFL