Gene PCC8801_4028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4028 
Symbol 
ID7103507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4221007 
End bp4222305 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content48% 
IMG OID643477023 
Productthreonine synthase 
Protein accessionYP_002374123 
Protein GI218248752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGG CAACTCAAAC CCAAACCAAA GCCGCCACCT TTACTCATCT TGTTTCTAAA 
GAAGGTGGCG TTAAATATCC CCTCAAAGCC CTTCACGTTT GCGAGGAAAC CTTCTCTCCC
CTGGAAGTGG CCTATGATTA CGATGCCATC CGTGCCCAAG TGACCCGCGA AAGCATCCAA
GCTGGACCCA ACTCCATTTG GCGTTACAAA GCGTTTTTGC CCGTAGAAAG CGAAAATCCC
ATTGATGTTG GCACCGGGAT GACCCCCCTC GTTAAATCGC ACCGTTTAGC CCGTCGCCTG
GGTCTAAAAA ATCTTTATAT CAAAAACGAT GCCGTCAATA TGCCCACCCT CAGCTTCAAA
GATAGGGTGG TGTCCGTTGC TCTCACTAGA GCCAAAGAAC TGGGATTTAC CACCGTTTCC
TGCGCCAGTA CGGGGAATTT AGCCAATTCT ACAGCAGCGA TCGCCGCCCA TGCAGGGTTA
GACTGTTGCG TGTTCATTCC GGCAGATTTA GAAGCGGGTA AAGTCCTGGG TACTCTCATC
TACAATCCGA CCGTGATGGC CGTCAAAGGG AACTACGACC AAGTGAACCG TCTCTGCTGC
GAAGTGGGTA ACAGCTACGG ATGGGGCTTT GTTAACATCA ATTTACGTCC CTACTACTCG
GAAGGGTCAA AAACGCTAGG ATTTGAAGTG GCCGAACAAT TAGGGTGGAA ACTCCCTGAT
CACGTCGTTG CTCCCTTAGC GTCGGGTTCC CTCTACACCA AGATTTACAA AGGCTTCCAA
GAGTTCATCA AAACCGGGTT AGTCGAAGAT AAAGCGGTTC GGTTCAGTGG AGCCCAAGCG
GAAGGTTGTT CTCCCATTGC GGCTGCGTTT AAAGAAGGTC GGGACTTTGT AACCCCAGTT
AAACCCAATA CTATTGCTAA ATCCATCGCT ATTGGTAATC CTGCTGATGG TTATTACGCC
TTAGATATTG CGCGTAAAAC CAACGGGAAT ATTGAAAGCG TCACCGATGC AGAGATCGTC
GAAGGGATTA AACTTTTAGC GGAAACTGAA GGCATTTTCA CGGAAACCGC AGGGGGAACT
ACCATTGCGG TCCTCAAAAA ACTGGTAGAA GCGGGTAAAA TTGATCCTGA AGAAACTACC
GTAGTTTATA TCACCGGAAA CGGATTAAAA ACCCAAGAAG CGGTGCAAGA GTACATCGGT
CAACCCCTAA TTATCGAGCC TAAATTAGAC AGTTTTGAAC GAGCTCTGGA ACGTTCTCGG
ACTCTAGAAC GTCTAGAATG GCAACAGGTT TTAGTTTAG
 
Protein sequence
MTQATQTQTK AATFTHLVSK EGGVKYPLKA LHVCEETFSP LEVAYDYDAI RAQVTRESIQ 
AGPNSIWRYK AFLPVESENP IDVGTGMTPL VKSHRLARRL GLKNLYIKND AVNMPTLSFK
DRVVSVALTR AKELGFTTVS CASTGNLANS TAAIAAHAGL DCCVFIPADL EAGKVLGTLI
YNPTVMAVKG NYDQVNRLCC EVGNSYGWGF VNINLRPYYS EGSKTLGFEV AEQLGWKLPD
HVVAPLASGS LYTKIYKGFQ EFIKTGLVED KAVRFSGAQA EGCSPIAAAF KEGRDFVTPV
KPNTIAKSIA IGNPADGYYA LDIARKTNGN IESVTDAEIV EGIKLLAETE GIFTETAGGT
TIAVLKKLVE AGKIDPEETT VVYITGNGLK TQEAVQEYIG QPLIIEPKLD SFERALERSR
TLERLEWQQV LV