Gene Synpcc7942_2175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2175 
Symbol 
ID3773732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2254747 
End bp2255847 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content53% 
IMG OID637800620 
Producttransport system substrate-binding protein 
Protein accessionYP_401192 
Protein GI81300984 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.377327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCACA ACCTATCTCT ACCCTCCATG TCTGAATCAA TGTTCAGTCG TCGGGACTTT 
TTGTTGGGCG GGACAGCTCT CGCCGGAACG CTATTACTCG ATAGTTTTGG TGACTGGCGC
CGTCGGGCAG AAGCTGCTGA AGGTGAAGTC AATTTGTACT CGGGTCGGCA CTACAACACC
GACAATCAGA TCTATCGGGA ATTCACCCAA AAAACAGGGA TTAAAGTCAA TCTAATTGAA
GGTGAAGCCG ATGCACTTTT AGCCCGTCTC AAGAGTGAAG GCAGCCGCAG CCCGGCAGAT
GTTTTCATTA CGGTTGATGC GGGGCGCCTT TGGCAAGCGA CTCAAGCCAA CCTGCTCAGA
CCACTGACTC AAGCCCAAGC TCCGAAACTG TATCAAGCAG TTCCGGCGAA TCTGCGGGAT
CCCCAGGGAC GTTGGTTTGC CTTGTCCAAG CGGGCGCGAG TCATTATGTA CAACCGCGAT
CGCGTCAATG CTAGCCAGCT GTCTACCTAC GAGGATTTGG CCAATCCAAA ATGGCGCAAT
CAAATCCTCG TGCGCAGTTC CAGCAACGTC TATAACCTCT CCTTGACCGG TGAGATGATT
GCTGCGGATG GTGCGGCCAA AACTGAGGCT TGGGCGCGGG GGCTTGTCCA AAACTTTGCG
CGTCAGCCCC AAGGTGGAGA TACCCCGCAA ATTCTGGCTT GCGCCGCGGG TGTCGGCTCT
CTGGCAATTG CCAACACCTA CTATTTGGTG CGTCTCTTCA AGTCGAAAAA AGCAGAAGAG
CGGGAAGCTG CAAGAAAGAT TAAAGTCTTC TTCCCCAACC AAAAAGGACG GGGCACCCAC
GTCAACATCA GCGGTGCTGG CATCGTCCGC ACGGCCCCGA ATCCACGGGC TGCTCAACTG
TTACTGGAGT ACCTGCTCAG TAGCCAAGCC CAGGCTGTGT TCGCTAGAGG CAATGGTGAA
TATCCAGTCT TGCGTGGGGT CTCGCTCGAT CCGATTTTGG CAGGCTTTGG TCAATTCAAA
GAATCCAAAA TCAGTGCCTC AGTCTTCGGG GCCAATAATG CTCAAGCCCT GCAGTTGATG
GATCGTGCTG GCTGGAAATA A
 
Protein sequence
MHHNLSLPSM SESMFSRRDF LLGGTALAGT LLLDSFGDWR RRAEAAEGEV NLYSGRHYNT 
DNQIYREFTQ KTGIKVNLIE GEADALLARL KSEGSRSPAD VFITVDAGRL WQATQANLLR
PLTQAQAPKL YQAVPANLRD PQGRWFALSK RARVIMYNRD RVNASQLSTY EDLANPKWRN
QILVRSSSNV YNLSLTGEMI AADGAAKTEA WARGLVQNFA RQPQGGDTPQ ILACAAGVGS
LAIANTYYLV RLFKSKKAEE REAARKIKVF FPNQKGRGTH VNISGAGIVR TAPNPRAAQL
LLEYLLSSQA QAVFARGNGE YPVLRGVSLD PILAGFGQFK ESKISASVFG ANNAQALQLM
DRAGWK