Gene Syncc9902_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2104 
Symbol 
ID3742106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp2007592 
End bp2008767 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content63% 
IMG OID637772301 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_378105 
Protein GI78185671 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGGA TCACCAATGT GCGCCTGCCC GCACCCTTGG GCGGTGCAAC CAAACAACGC 
CATTGGCTGA GCCTCAGCGT TGATGACACG ATCACTGGAA TCGGCCGGAT GGGACCGAAC
GACAGCCCAA GCGACAAGGC ACGTCCTGAC ACGGACGCCG AAAACTGGAA CGGCGACTGG
ATTAGCCCTC GCGCAATCGA TCTGCAAATC AATGGCGGCT TGGGCTTGGC CTTCCCCGAA
CTGACACCCA GCGACCTACC GCGACTCGTG GAACTGCTGG ATTTGCTATG GAGCGATGGC
GTCGAAGCGA TTGCGCCAAC GTTGGTGACG TGCGGAATTG AGCCCCTTCG TAATGCCCTG
GCCGTTTTAC GAGAGGCCAG GACGCAGCAT TGCGCTGGCC GCTGTCAGCT TTTGGGGGCC
CACCTTGAAG GTCCATTTCT GGCGGAATCA CGCCGGGGCG CCCATCCCCG GGAGCACCTG
GCCAGCCCAA CCCTGACGGC CCTGGAGGCA CGCATCAACG GCTTCGAATC CGAGATTGCC
CTGATGACCC TGGCCCCAGA ACTGGTGGGG GCCGATGCAG TGATTCAGCG GCTGAAGGAT
TTGGGAATCA TGGTGGCCCT CGGGCACAGC GCCGCCAACG CCAACACCGC TGGCCAAGCG
TTCGAGAGAG GGGTGGGCAT GCTGACCCAC GCCTTCAATG CCATGCCAGG ACTCCATCAC
CGCGCGCCAG GACCGATCGG CGAAGCCTGT CGAAACGGCC ACATCGCCCT AGGACTGATT
GCTGATGGCG TGCATGTCGA CCCCACCATG GCGGTGTTGC TTCAACGGCT GGCCCCAAGC
CAAATCGTGT TGGTGAGCGA TGCCCTAGCG CCCTACGGGC TCGCCGATGG CACACACCGC
TGGGATGAAC GAACCCTGCT GGTGAAGAAC GGCACCTGCC GCCTCGAGGA CGGAACCCTG
GCGGGGGTGA CCTTGCCACA ACTCGAGGGA GTGAAGCGCC TTGCCCGCTG GAGCAAGGAC
GGATCAGCCG CCATCTGGAG TGCCACGGTG GCCCCCCGCG GCCTCATCAA CGGCCCTAAC
GGGGTCGTCG ACGCTTTGAT CGGGAAACCC CTTTCAGCAC TTCTGCGCTG GCATCAACCC
GAGCAAGGAG ACCTTGACTG GACCTGCGCT GCTTAG
 
Protein sequence
MRRITNVRLP APLGGATKQR HWLSLSVDDT ITGIGRMGPN DSPSDKARPD TDAENWNGDW 
ISPRAIDLQI NGGLGLAFPE LTPSDLPRLV ELLDLLWSDG VEAIAPTLVT CGIEPLRNAL
AVLREARTQH CAGRCQLLGA HLEGPFLAES RRGAHPREHL ASPTLTALEA RINGFESEIA
LMTLAPELVG ADAVIQRLKD LGIMVALGHS AANANTAGQA FERGVGMLTH AFNAMPGLHH
RAPGPIGEAC RNGHIALGLI ADGVHVDPTM AVLLQRLAPS QIVLVSDALA PYGLADGTHR
WDERTLLVKN GTCRLEDGTL AGVTLPQLEG VKRLARWSKD GSAAIWSATV APRGLINGPN
GVVDALIGKP LSALLRWHQP EQGDLDWTCA A