Gene Syncc9605_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_1079 
Symbol 
ID3737791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1018174 
End bp1019175 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content57% 
IMG OID637775670 
ProductABC transporter, substrate-binding protein, aliphatic sulphonates 
Protein accessionYP_381391 
Protein GI78212612 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0298231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAAC AACTACGCAA CCTACTTTTC GCCGGCCTCG CCATCGTGCT GGCTGTCGCT 
TGCTCCAAGC CCTCCACACC GACGGTGGGC GGCACACCAA TTGTTTTGGG CTACAGCAAC
TGGGCCGGAT GGTGGCCCTG GGCGATTGCC GTGGAGGAAA AGCTGTTCGA GAAGAACGGC
GTAAATGTGG AGATGAAGTG GTTCGACGGC TACGTGCAGT CGATGGAAAC CTTCGCCGCT
GGCAAGATCG ACGGCAACTC CCAAACCTTG AACGACACCA TTTCCTTCCT GCCGGGTGAA
AACGGCGGTG AAGTGGTGGT TTTGGTGAAC GACAATTCTG CTGGCAATGA CCAGATCATT
GCCGATGCCT CCATCACATC CATCACCGAT CTCAAAGGCA AGACCGTTGC TGTTGAGGAA
GGCGTTGTGG ATGACTACCT GCTCAGCCTG GCCCTCAAGG ATGCGGGCCT GAGCCGCGAC
GACGTGGTGA TCAAAGGCAT GCCCACCGAT CAGGCAGCCA CTGCATTCGC GGCAGGTCAG
GTTGATGCCG TTGGTGCCTT CCCTCCCTAC ACAGGTACCG CCATGCAGCG AGAAGGTGCG
CAGGTGATCG CCAGTTCCAA GGAGTATCCC GGTGCCATTC CTGATCTGCT CACCGTCAGC
GGTGATCTGA TTAAGGAACG TCCCGACGAT GTGCAGAAGA TCGTGAAGAC CTGGTGGGAC
GTTCGCGAGT TCATGGAAAA GAACCGCGAA AAATCCGAGG CGATCATGGC CAAGCGTGCC
GGCATTCCCA CGGAGGAATA CGAGCAGTAC AAAGACGGCA CCCGCTTCTT CTCCATCGAG
GAAAACCTCG AGGCCTTCAG CGCTGGTGAG GGAATGAAGT TCATGCCGTT CGCTGCTGAG
TCGATGGCCG ACTTCATGGT TTCGGTGGGC TTCATCCCTG AGAAACCAGA CATGAGCAAG
CTGTTTGACG ACAGCTTCAT CAAGAAGGTC GCCGCCTCCT GA
 
Protein sequence
MTKQLRNLLF AGLAIVLAVA CSKPSTPTVG GTPIVLGYSN WAGWWPWAIA VEEKLFEKNG 
VNVEMKWFDG YVQSMETFAA GKIDGNSQTL NDTISFLPGE NGGEVVVLVN DNSAGNDQII
ADASITSITD LKGKTVAVEE GVVDDYLLSL ALKDAGLSRD DVVIKGMPTD QAATAFAAGQ
VDAVGAFPPY TGTAMQREGA QVIASSKEYP GAIPDLLTVS GDLIKERPDD VQKIVKTWWD
VREFMEKNRE KSEAIMAKRA GIPTEEYEQY KDGTRFFSIE ENLEAFSAGE GMKFMPFAAE
SMADFMVSVG FIPEKPDMSK LFDDSFIKKV AAS