Gene Synpcc7942_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2107 
Symbol 
ID3774326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2186865 
End bp2188454 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content50% 
IMG OID637800552 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like 
Protein accessionYP_401124 
Protein GI81300916 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.287431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCCC CAACACTTCT TGAGGAGCAA TCTAAGCTCC TCCATTCTGC TGCGTGTCGC 
TGTCAGGATT GCTATCGTCT GACGACCGAT CACGATCGCT TTTTGGAAGA TATGCCGCAA
GACCCTGAGA TCTTGATGGC AGATTTTCAA AAGATGGGTT TATTCAAACC TGAATCGATT
GCGATCGCCG ATCGTCTGAC GACTTCAGAG TTGCGCCAAG CCTTGTTCTT TAAAAATGCT
TCCCAAGGGG ATCCTGAACA GGAAGCAATG CTGAGAGCTT TGGCTGCAGA AGCAGGTGGT
TTAGATCAGG CATTTGCTGC TGCTTTTGGA CCGCAAGCGG GTCGTTTTTT CAGCAACATT
CAAGCTAGTG GCGGGGTCAG TCGTCGGGTT TTCCTGCGCA ACTTGGTCGT TGGTGCAGCT
CTCGTGACGC TGACCAACTG TGCCCAACAG GCTCAACAAC CCGATAGCCC AACCACGACT
GGCAGTGGCA ATTTAGAAAA AACGGATCTG AAGGTTGGCT TTATCCCGAT TACCTGCGCC
ACTCCGATCA TCATGTCAGA TCCCTTGGGC TTCTATCAGA AATATGGCTT GAAAGTCCAA
GTTGTGAAGA TGCCGAGCTG GGGGGCAGTT CGAGATTCTG CGATCGCAGG CGAATTGGAT
GCCTATCACA TGCTGGCACC GATGCCGATC GCGATGACCT TGGGTCTTGG CTCAGCTCCC
TTCAGTGTCA AGTTAGCCAG TATTGAAAAT ATTAACGGTC AGGCGATTAC GGTTGCCAAA
CGTCACCTTG GCAAAGTCAA AGAAGCGAAA GACTTCAAAG GCTTTGTGAT TGGGGTCCCC
TTCCCCTTCT CAATGCATAA CCTGCTGTTG CGCTACTATC TCGCTGCTGG TGGTTTGAAT
CCCGATACCG ATGTCCAAAT TCGGCCAGTT CCCCCGCCAG ATAGTATTGC TCAGCTCGTC
GCAGGTGATA TCGATGCGAT GCTGATGCCC GATCCCTTTA ATCAGCGGGC AGTGTATGAA
GATGCTGGCT TTATTCATCT GTTAACTAAA GAAATTTGGA ATGGTCATCC TTGCTGTGCA
TTTGCAGCAG GTGAGCCTTG GATTCAAGAA AATCCCAATA CGTTCCGAGC GCTTAACAAA
GCAATTATTG AAGCAACTGG TTATGCCAGT AAGGCCGAAA ATCGTGCTGA GATTGCCAAG
GCTATTTCTA GCCGTCAGTA CTTAAATCAA CCACCCGAAG TCGTGGAAGC TGTGCTGACC
GGTAAGTTCC CCAATGGTCA AGGTCAAGAA CTGGATGTTC CCGATCGCAT TGACTTCAAT
CCCTACCCAT GGCAGAGCTT TGCCAACTGG ATTCAATCGC AGCTAGTGCG TTGGGATCTG
GGTAAAGCTG CCGGTGTGAT CCAGCCCGAT CAGTACGACA AGAACGGTCA GGCAATTTAC
CTGACGACTG AAGCACAAAC CCTCGAGAAG GAAGTGGGCC TGCAGCCGCC GACTGAAATC
TATCGGGAAG AAAAGCTCGC TTACGACACC TTTAACCCGC AGGATCCAGT CGCTTACCTC
GCATCTCAAA AGCAGAAATA CGGGAGATAA
 
Protein sequence
MVPPTLLEEQ SKLLHSAACR CQDCYRLTTD HDRFLEDMPQ DPEILMADFQ KMGLFKPESI 
AIADRLTTSE LRQALFFKNA SQGDPEQEAM LRALAAEAGG LDQAFAAAFG PQAGRFFSNI
QASGGVSRRV FLRNLVVGAA LVTLTNCAQQ AQQPDSPTTT GSGNLEKTDL KVGFIPITCA
TPIIMSDPLG FYQKYGLKVQ VVKMPSWGAV RDSAIAGELD AYHMLAPMPI AMTLGLGSAP
FSVKLASIEN INGQAITVAK RHLGKVKEAK DFKGFVIGVP FPFSMHNLLL RYYLAAGGLN
PDTDVQIRPV PPPDSIAQLV AGDIDAMLMP DPFNQRAVYE DAGFIHLLTK EIWNGHPCCA
FAAGEPWIQE NPNTFRALNK AIIEATGYAS KAENRAEIAK AISSRQYLNQ PPEVVEAVLT
GKFPNGQGQE LDVPDRIDFN PYPWQSFANW IQSQLVRWDL GKAAGVIQPD QYDKNGQAIY
LTTEAQTLEK EVGLQPPTEI YREEKLAYDT FNPQDPVAYL ASQKQKYGR