Gene Synpcc7942_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1237 
Symbol 
ID3773525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1258978 
End bp1260957 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content59% 
IMG OID637799666 
Productnitrate transport ATP-binding subunits C and D 
Protein accessionYP_400254 
Protein GI81300046 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1116] ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component 
TIGRFAM ID[TIGR01184] nitrate transport ATP-binding subunits C and D 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTTT TCTTAGCGGT TGACCACGTT CATCAGGTCT TTGACCTACC GGGTGGCGGA 
CAATATATCG CCCTCAAAGA TGTCAGTTTG AATATTCGCC CCGGCGAATT TATCTCCTTG
ATTGGCCACT CCGGCTGTGG TAAATCGACG CTGCTCAACT TGATTGCTGG CCTTGCCCAA
CCCAGCAGCG GCGGCATCAT TCTTGAAGGT CGGCAGGTCA CGGAACCTGG CCCCGATCGC
ATGGTGGTTT TCCAGAACTA TTCGCTCTTA CCGTGGCGAA CTGTCCGCCA GAATATCGCC
CTTGCCGTCG ATAGTGTTCT GCACGATCGC AATCGCACCG AGCGCCGCAC GATTATCGAA
GAAACCATTG ATCTTGTTGG GTTGCGGGCC GCAGCCGACA AATATCCCCA CGAGATTTCC
GGCGGGATGA AACAGCGGGT CGCGATCGCA CGCGGCTTGG CGATTCGGCC CAAACTGCTA
CTCCTAGATG AACCCTTTGG TGCCTTGGAT GCCCTGACCC GTGGCAACCT GCAAGAGCAA
CTGATGCGGA TTTGCCAAGA GGCTGGTGTG ACGGCGGTGA TGGTCACCCA TGATGTCGAC
GAGGCATTGT TGCTCTCCGA TCGCGTCGTG ATGCTAACGA ACGGTCCCGC CGCCCAAATC
GGTCAGATTC TGGAAGTTGA TTTCCCCCGT CCCCGCCAAC GGCTGGAGAT GATGGAAACG
CCTCACTACT ACGATCTGCG CAACGAGCTG ATCAACTTCC TACAGCAACA GCGACGGGCC
AAGCGGCGAG CCAAAGCTGC AGCACCAGCT CCGGCTGTGG CAGCGTCGCA GCAGAAAACT
GTCCGACTGG GCTTTTTGCC CGGCAACGAC TGCGCTCCCT TGGCGATCGC TCAAGAATTG
GGTCTGTTCC AAGACCTCGG TCTGTCCGTG GAACTGCAGT CCTTCCTGAC CTGGGAGGCA
CTGGAAGACA GCATCCGGCT GGGGCAACTG GAGGGCGCCT TGATGATGGC GGCCCAGCCC
CTCGCCATGA CCATGGGTCT GGGGGGACAC CGGCCTTTCG CGATCGCAAC ACCCCTGACG
GTCAGCCGCA ACGGTGGGGC GATCGCCCTC TCCCGTCGCT ACCTCAACGC CGGTGTACGC
AGCCTCGAAG ATCTCTGCCA GTTCTTAGCT GCAACTCCCC AGCGATTGCG CCTCGCCATT
CCTGATCCAA TTGCCATGCC AGCCCTGCTG CTGCGCTATT GGTTGGCCAG CGCCGGCCTC
AACCCCGAGC AAGACGTTGA GCTGGTGGGA ATGTCGCCCT ACGAAATGGT GGAGGCACTC
AAAGCCGGTG ACATCGATGG CTTTGCGGCG GGTGAAATGC GGATTGCTCT CGCCGTCCAA
GCAGGAGCCG CCTACGTCCT AGCAACCGAT TTGGACATCT GGGCGGGACA TCCTGAGAAG
GTTTTGGGGC TGCCAGAAGC GTGGCTACAA GTGAATCCTG AAACCGCGAT CGCCCTTTGC
TCGGCCCTGC TCAAAGCCGG CGAACTCTGC GACGATCCTC GTCAGCGCGA TCGCATTGTC
GAGGTCTTGC AACAACCGCA ATATCTCGGG TCTGCTGCCG GCACGGTGCT ACAGCGCTAC
TTTGACTTTG GCTTGGGCGA TGAACCCACC CAAATTCTGC GCTTCAACCA ATTCCACGTC
GATCAGGCCA ACTACCCCAA TCCGCTCGAG GGCACTTGGC TGCTGACTCA GCTCTGCCGC
TGGGGTCTGA CGCCCTTACC CAAAAACCGG CAGGAATTGC TCGATCGCGT CTATCGCCGC
GATATCTACG AAGCCGCGAT CGCTGCCGTG GGCTTCCCGC TCATCACTCC CAGTCAGCGA
GGCTTCGAAC TCTTCGATGC GGTGCCCTTC GACCCCGATA GCCCGCTGCG CTACCTCGAA
CAATTCGAGA TCAAAGCGCC GATTCAGGTT GCTCCCATTC CGCTTGCTAC CTCTGCCTAG
 
Protein sequence
MSVFLAVDHV HQVFDLPGGG QYIALKDVSL NIRPGEFISL IGHSGCGKST LLNLIAGLAQ 
PSSGGIILEG RQVTEPGPDR MVVFQNYSLL PWRTVRQNIA LAVDSVLHDR NRTERRTIIE
ETIDLVGLRA AADKYPHEIS GGMKQRVAIA RGLAIRPKLL LLDEPFGALD ALTRGNLQEQ
LMRICQEAGV TAVMVTHDVD EALLLSDRVV MLTNGPAAQI GQILEVDFPR PRQRLEMMET
PHYYDLRNEL INFLQQQRRA KRRAKAAAPA PAVAASQQKT VRLGFLPGND CAPLAIAQEL
GLFQDLGLSV ELQSFLTWEA LEDSIRLGQL EGALMMAAQP LAMTMGLGGH RPFAIATPLT
VSRNGGAIAL SRRYLNAGVR SLEDLCQFLA ATPQRLRLAI PDPIAMPALL LRYWLASAGL
NPEQDVELVG MSPYEMVEAL KAGDIDGFAA GEMRIALAVQ AGAAYVLATD LDIWAGHPEK
VLGLPEAWLQ VNPETAIALC SALLKAGELC DDPRQRDRIV EVLQQPQYLG SAAGTVLQRY
FDFGLGDEPT QILRFNQFHV DQANYPNPLE GTWLLTQLCR WGLTPLPKNR QELLDRVYRR
DIYEAAIAAV GFPLITPSQR GFELFDAVPF DPDSPLRYLE QFEIKAPIQV APIPLATSA