Gene Syncc9902_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0100 
Symbol 
ID3744053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp99650 
End bp101056 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content32% 
IMG OID637770266 
ProductHAD family hydrolase 
Protein accessionYP_376118 
Protein GI78183684 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases
[COG1083] CMP-N-acetylneuraminic acid synthetase 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0368988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATCATGA TTTCACACCA ATATAAAAAT ACTGACAAAG AAATAATGTA CAAGAAAATC 
TCAGCGATTG TCCCGATTAA GTTTAATTCG AGAAGACTTC CGAACAAAAA TTTCCTGAAT
CTTAATGGAA GGCCACTATG CTCATATATA TTTCAAACAT TAACACAAGT CGAAGGAATT
CATAACATTT ATTGTTATGC AAGTTCAAGT CTCCCACTAA ATTTTTTACC AAAATCAGTC
AAGTACCTTC AGAGACCTTC GTACCTAGAT GGGGATAATA TAGAGGCAAC TGAACTATTT
AGATACGCAA TAGAATCAAT AGATACGGAT ATTGTGATAA TTACACATGC TACATCGCCA
TTGATACATT CAGAATCAAT AGAAAGAGGA TTACAAGCAG TAATCTCAGG GGAATATAGA
TCGGCATATA GTGTACATAA AATACAAAAG TATTCGTGGT GTGATGGAAA ACCAGTCAAT
TTTACACCTT CTAAGCTTGA ACAAACTCAA AAAATATCAC CAGTATTGTA TGAAACATCA
GGATTTTATG TTTTTAGGAA AAGAGATTTT TTGGAAAGTA ATACAAGGAC TACTGAACCT
GCATTCAAAG TAGAGATACC CATATCAGAA GCAGTAGATA TTGACAATCC TGAAGACTTT
GAGCTAGCGA CAAAGTTGCA ATATGATAAT AATATAACAG AAAATGAGCT GACTTCGAAA
TACTTTGTTG ATCTAATCAA AAGAATAAGT CCAGAATCAA ATTTGAAAGA AAGCATTAGT
CATATATGCT TTGATTTAGA TGGTGTTTTG ATAGATTCAA AGATTGTGAT GAAGATAGCA
TGGGAGGAGT GTATGAGAAT ATTTGAATTG GAACAAACAT TCGAAGAATA TTTCACACAT
TTAGGAATCG AATTTTTTGA AATTTTGAAA AAAATTGAAA TTAAAAAAAA TTTGCATGAT
GATATCTATA AACTTTATAA TGAGGTATCT CTAAATAATT CATATAAAAT TAAAGTCTAT
AATAACACAA GAGAGGTATT AAAAAGATTA AAAAATGCAG GCTTCGGATT AAGTATATGT
ACATCCAAAT CACGAAAACG TACTATTGAG GTACTTAGAA GGAATAACTT GATTCAGTAT
TTTGACTATA TTTCTGCAAC AGAAAAAGAA AATAAACTTA GACCAAAACC AGCACCAGAC
TTTCTATTAG AGTGTTGTAC TAACTTAAAA GTAGACCCAT CGCAGACAAT CTATATAGGT
GACACAGATT ACGATCATGA ATGCGCTAAG AGAGCACATA CTGCATTTAT ACATGCAGAA
TGGGGATATG GGAGAGACAA TATTAGGGAA AAGTCAATAT GGTTAGAGTC CATTAAAGAC
CTTGACACCC TCTTGATCGA TGAGTAA
 
Protein sequence
MIMISHQYKN TDKEIMYKKI SAIVPIKFNS RRLPNKNFLN LNGRPLCSYI FQTLTQVEGI 
HNIYCYASSS LPLNFLPKSV KYLQRPSYLD GDNIEATELF RYAIESIDTD IVIITHATSP
LIHSESIERG LQAVISGEYR SAYSVHKIQK YSWCDGKPVN FTPSKLEQTQ KISPVLYETS
GFYVFRKRDF LESNTRTTEP AFKVEIPISE AVDIDNPEDF ELATKLQYDN NITENELTSK
YFVDLIKRIS PESNLKESIS HICFDLDGVL IDSKIVMKIA WEECMRIFEL EQTFEEYFTH
LGIEFFEILK KIEIKKNLHD DIYKLYNEVS LNNSYKIKVY NNTREVLKRL KNAGFGLSIC
TSKSRKRTIE VLRRNNLIQY FDYISATEKE NKLRPKPAPD FLLECCTNLK VDPSQTIYIG
DTDYDHECAK RAHTAFIHAE WGYGRDNIRE KSIWLESIKD LDTLLIDE