Gene Syncc9902_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1078 
Symbol 
ID3742462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1038456 
End bp1039757 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content55% 
IMG OID637771254 
ProductABC transporter, likely sugar solute binding protein 
Protein accessionYP_377086 
Protein GI78184651 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGAG GCTTCCGGAT TGCCAGCGGC TTGATCGGTC TGCTGTTTGT TCTGGCTTGT 
CTGGCATGGT CAGGCCAGAA ACAGCCTGTA CCCGTCTCAA TTTTGATGCC TGCGCCCTAT
GCGGATGCAA GTACTCAAAT GGTCGAAGCC TTCAATCGTG AGCACCGCGG ATCAATTCAC
CTTGAGGTCA TCCGTGGACC CCTGGAAACC GAGTCGATTT CTGATCTCGC AATTAGCAGT
TTGTTGCTGG GGGACACCCC CTTTGATGCC CTGTTGATGG ATGTCACCTG GCTGCCGAAA
TATGCCGAAG CAGGTTGGCT CGAACCCCTT GATCCCTGGT TCGATCAAGC CGCTCTTGAT
CAATTGATCT TGGGCGCTCG CCTTGGCAAT CACCACAAAG GTCAGCTCTA TCGATGGCCC
CTTGGCGCTG ATGTTGGCGT TTTGTATTGG CGGACCGACC TGATGGACTC GCCTCCCTCC
ACTCCAGAGC AACTGTCCGA CATCGCCATC AATTTGGTCC AAACCAAGCG CGTCACCAAT
GGCTATGTCT GGCAAGGAAG GCAATACGAA GGATTGAGTT GCAACTTCGT TGAAGTGCTC
CATGGCTTCG GAGGCCAGTG GCTTAACCCA GCCACCGGTC AACTCGAACT CGACGCAACG
TCGGCAGCTC GTGCAGCCGC CTGGATGCAA TCACTGATCA CAACTGGAGC CAGTCCAAAA
GCGGTAATCA ATTATTCGGA ATCAGAATCG CTGCAGGCAT TTAAAGCCGG GGATGCAGCA
TTCATGCGCA ACTGGCCCTA TGCCTGGGCA GAGCTCCAAA AACCCGAAAG CAACGTGCGG
GGAAACGTTG GAATTGCACT GATGGTTGCG GAACCCGATC AAAGTCCTGC CGCCACCGTT
GGCAGCTGGG GTTTGAGCTT GCTCAAACAG TCACCGCATC AAGAGGCCGC TGTTGAAGCG
ATTCGATATC TCACCTCTGA AGCTGCGCAG CGTGAACGGT TCCTCAACCA GGGCTACACC
CCCACCAGCA AAAGCTTGTT TAGGGATCCA GAGCTCATCG CGGTGTCGTC GGCCCTTCCT
GAAATCGCGA AGGCTTTGGA GTACTCCGTT TCAAGGCCAC CATCACCGCT TTATGCCCAG
CTCAGTGATC TTCTTCAACG CCAACTGAAT GGCCTGCTAA CTGCCGAACC AGAGAAGACC
AATCGCGATG TCGGCACGGC AACTGCCGCC TCAGCCGCTG CCATGAACAG AGTTCAAACC
AAAAGCGACA TGCTGCTTAA AGCGACAGGG GCAGAAGCAT GA
 
Protein sequence
MRRGFRIASG LIGLLFVLAC LAWSGQKQPV PVSILMPAPY ADASTQMVEA FNREHRGSIH 
LEVIRGPLET ESISDLAISS LLLGDTPFDA LLMDVTWLPK YAEAGWLEPL DPWFDQAALD
QLILGARLGN HHKGQLYRWP LGADVGVLYW RTDLMDSPPS TPEQLSDIAI NLVQTKRVTN
GYVWQGRQYE GLSCNFVEVL HGFGGQWLNP ATGQLELDAT SAARAAAWMQ SLITTGASPK
AVINYSESES LQAFKAGDAA FMRNWPYAWA ELQKPESNVR GNVGIALMVA EPDQSPAATV
GSWGLSLLKQ SPHQEAAVEA IRYLTSEAAQ RERFLNQGYT PTSKSLFRDP ELIAVSSALP
EIAKALEYSV SRPPSPLYAQ LSDLLQRQLN GLLTAEPEKT NRDVGTATAA SAAAMNRVQT
KSDMLLKATG AEA