Gene Syncc9902_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0101 
Symbol 
ID3744054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp101031 
End bp102050 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content36% 
IMG OID637770267 
Productcapsular polysaccharide biosynthesis protein 
Protein accessionYP_376119 
Protein GI78183685 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID[TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0202415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATAG GTATGAAAAA AATCCTTTTA ACTGGCGGAA CCGGAAGTTT TGGGAAAGCT 
TTTATTAAAG AAACAATAAC AAAATATCAA GATGTAGAAA GATTAGTCAT TTATAGTAGA
GATGAACTGA AACAATGGGA ACTACAGCAA ATCTATCCAG AGAAACAATA TCCACAAATT
AGATTTTTTT TGGGAGATGT CAGAGACGAA AACAGATTAC GAAGAGCACT AGAAGGGATA
GATACTGTAG TTCATGCAGC AGCATTAAAG CAAGTGCCAG CAGCAGAATA TAATCCTTTC
GAATTTGTGA AAACTAACAT AATTGGTGCA AACAACTTAA TTCAAGCATG TCTAGATACT
GAAGTATCAA ACATTGTAGC CCTAAGCACT GACAAGGCTG CTGCACCAAT TAACTTATAT
GGAGCCACAA AACTGTGCTC AGACAAGTTA TTCATCGCAG CAAACAATGT CAGAGGCGGA
AAAAATACAA AATTCTCAGT AGTAAGATAT GGGAACGTAA TGGGGTCAAG AGGTTCGGTG
ATACCATATT TTTTAAAAGA AGCCAAAAAT TCAGGAAAAC TAAACATAAC TGACACCAGG
ATGACCAGGT TCAACATAGT GCTAAAGGAA GGCGTAGAGA TGGTACATTG GGCAATAAAG
CAAAGCATGG GTGGGGAAAT ATTTGTGCCT AAGATACCAA GTTATCGTAT TGTTGATGTT
GCTGAAGCTA TTGCACCTTC GTTGAACCAC GAAGTAATAG GAATACGTCC AGGAGAGAAA
ATTCACGAAG AAATGATAAC AACATCAGAC AGCACAACGA CACTTGACTT AGGTAAATAT
TATGCAATTA CGCCTGCTGG AGGTGGAGTA ATTGAAAAAT ATAAAAAAGA AGATAGGCCT
TATGAAAGAG TAAAAGAAGG ATTTACATAC AATTCATTAG ATAATAAACA ATATCTCAAT
ATAAGTGAAA TAAGAGCCCT AATCAGAAGT AATATTGATC ATGATTTCAC ACCAATATAA
 
Protein sequence
MNIGMKKILL TGGTGSFGKA FIKETITKYQ DVERLVIYSR DELKQWELQQ IYPEKQYPQI 
RFFLGDVRDE NRLRRALEGI DTVVHAAALK QVPAAEYNPF EFVKTNIIGA NNLIQACLDT
EVSNIVALST DKAAAPINLY GATKLCSDKL FIAANNVRGG KNTKFSVVRY GNVMGSRGSV
IPYFLKEAKN SGKLNITDTR MTRFNIVLKE GVEMVHWAIK QSMGGEIFVP KIPSYRIVDV
AEAIAPSLNH EVIGIRPGEK IHEEMITTSD STTTLDLGKY YAITPAGGGV IEKYKKEDRP
YERVKEGFTY NSLDNKQYLN ISEIRALIRS NIDHDFTPI