Gene Syncc9605_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2035 
Symbol 
ID3737711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1853214 
End bp1854350 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content59% 
IMG OID637776621 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_382330 
Protein GI78213551 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCAT CGATGCCTTC CGCCTCCGAC CTGCTGGGAA ATCGCCGCAG AGTTCTGGTC 
ACCGGAGGTG CCGGCTTCAT TGGCGGGGCC GTCGTGCGCA GGCTGCTGCG GGAGACCACA
GTCACCGTCT TCAATCTCGA CAAGATGGGC TATGCCAGCG ACCTGTCCTC GATTGAGAAG
GTGCTGAGCG AACTGGGCGA AGCGGCCAAC GATCGGCACA GGCTTCAGCA AGTCGATCTC
ACCGATGCAA CAGCCGTGGA GGCTGCGGTG CAGGAGGCCG ACCCCGATCT TGTGATGCAC
CTGGCAGCGG AAAGCCACGT GGATCGATCC ATCTCCGGCC CTGGTGTCTT TATCGAGAGC
AACGTCAACG GGACCTACAA CCTCCTACAG GCGGTGCGAA GCCACTACGA GGGCTTGAGC
GGTGAACGCC GTGATTCCTT CCGGATGCAC CACATCAGCA CCGACGAAGT CTTTGGATCC
CTGGGCGCCG AGGGGCGCTT CTCAGAAACA ACGCCCTACG ACCCCCGCAG CCCCTACTCC
GCCAGCAAGG CGGCAAGCGA TCACCTGGTT CAGGCCTGGC ACCACACCTT TGGGCTTCCC
GTGGTGCTCA CCAACTGCTC AAACAACTAT GGCCCCTGGC AGTTCCCGGA AAAACTCATT
CCCGTTGTCA CCTTGAAGGC CGCTGGATGT GAGTCAATTC CTCTTTATGG CGATGGGCTG
AATGTGCGGG ATTGGTTGTA CGTGGAAGAT CATGTCGACG CGCTGCTTTT GGCCGCCTGC
AAGGGAGAGT CGGGACACAG TTATTGCGTC GGCGGCCACG GTGAACGCAC CAATAAAGAG
GTCGTTAACG CCATCTGCCA ACAGATGGAT CAAAGCCGTC CCACATCAGC TCCCCACGCA
GATTTGATTA CGCCGGTGAC CGATCGACCA GGCCATGACC GCCGCTACGC AATCGACCCA
AGCCGCATCA GTGCAGAGCT GGGCTGGAGC CCTCGTCACG ATGTTGAGCA AGGACTCGCC
GAGACAGTGA ACTGGTATTT GGCCAATCAG GACTGGTGCA ACAAGGTGCG TCAGCGTGCG
GGATATGACG GCAGCAGATT GGGCATGAGG ACACCAAAAA CCAACTCAAA CGAGTGA
 
Protein sequence
MVSSMPSASD LLGNRRRVLV TGGAGFIGGA VVRRLLRETT VTVFNLDKMG YASDLSSIEK 
VLSELGEAAN DRHRLQQVDL TDATAVEAAV QEADPDLVMH LAAESHVDRS ISGPGVFIES
NVNGTYNLLQ AVRSHYEGLS GERRDSFRMH HISTDEVFGS LGAEGRFSET TPYDPRSPYS
ASKAASDHLV QAWHHTFGLP VVLTNCSNNY GPWQFPEKLI PVVTLKAAGC ESIPLYGDGL
NVRDWLYVED HVDALLLAAC KGESGHSYCV GGHGERTNKE VVNAICQQMD QSRPTSAPHA
DLITPVTDRP GHDRRYAIDP SRISAELGWS PRHDVEQGLA ETVNWYLANQ DWCNKVRQRA
GYDGSRLGMR TPKTNSNE