Gene Synpcc7942_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1572 
Symbol 
ID3774996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1631339 
End bp1632925 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content55% 
IMG OID637800005 
Productdehydrogenase subunit-like protein 
Protein accessionYP_400589 
Protein GI81300381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.849895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.707104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCA ATCAAACTGA CTACGACATT GTCATCATTG GCACTGGGGC TGGAGGAGGA 
ACGCTACTCC ATCGCCTTGC TCCAAGCGGC AAGAAGATTC TGATTCTTGA GCAAGGGCCT
TATCTTCCCC GTGAAAAGGA AAACTGGAGC GCGACTGAGG TCTACGGCAA GGAGCGCTAC
CACACCCATG ATCAGTGGTA CGACAAACAG GGAACAGCCT TTCGACCCCA GATGAGCTAC
TGGGTAGGAG GCAACACCAA AGTTTATGGG GCTGCTTTGC TCCGTCTGCG GGAGCGCGAC
TTTGAAGCCG TTCAACATCG GGATGGGATC TCGCCCGAGT GGCCACTAAA ATACGCAGAC
TTTGAGCCTT ACTACAGTCA AGCTGAAGCC CTATTTGATG TTCACGGCGA GGATGGCAAC
GATCCCACCG CCCCACCGCG CCAAACTCCT TACCCCTATC CGGCAGTCAG TCATGAACCC
CGCATGCAGG AGATCTATGA CGGGCTGAAA GGACAAGGAC TGAAGCCGTT CCATTTGCCC
GTAGGTCTCA AACTTAACGA GACAGAGCGT GCCCTCTGGG ACTGCATCCG CTGCGATAGC
TGTGATGGTT TTCCCTGTTT GGTGCGGGGC AAAGCCGATA GTGAAGTTAA TGCCGTCACA
CCAGCGCTGA CTTACCCAAA CGTCACCCTC AAGACCGAGG CCAAAGTACT GCGGTTACTG
ACCAATGAAT CGGGCCGCGA GGTGACGGCA GTGGAAACCC AAATTGGTGG TGAGACTGTT
CTGTTCAAGG GCGATGTGGT GGTGGTCAGT TGTGGCGCTG CGAACTCCGC CGCTCTGCTG
CTGCGATCGG TGAGCGATCG CCATCCCAAT GGCCTAGCCA ACAGTTCCGA TCAGGTCGGG
CGCAACTTCA TGAAGCACTT GACGACGGCG ATGGTGGCAC TTAACGCCAA GAAAAATGAG
TCGGTCTACC AAAAAACGAT CGCAGTCAGC GACTTCTATT GGGGCGAAGA CGGTTACGAC
TACCCAATGG GCTTCATCCA AAATACGGGC AATGTGTTGC CCGACATGAT GCCGGCTGAA
GCGCCGGGAT TACTGGCATC ACTGCTCAAG TTCGTGCCCA GGCTGGATGT TGATTTAGGG
CCTCAATACC AGGCTGCGGC TCAGCACTCA GTGGGCTGGT GGTTCCAAAC GGAAGACCTG
CCTGATCCAA ACAATCGGGT GCGAGTCGTG AACGACCAAA TTCATTTGGA CTACACGCCA
AATAACACTG AGTCGACTCA GCGCTTGGTT CATCGCTGGA TTGACATTCT GAAAGCAGTC
GACCGTGCCG ATCACGTTCT TCCCTTTGAC CTCTATCCCC GCAGTAGTTC GCCCATTCAG
GTGTTGGGAC ACCAGTGCGG CACCTGTCGC TTTGGGGAAG ACCCGACAAC ATCAGTGCTC
GATCTCAACT GTCAGGCTCA CGATGTTGAC AACCTCTATG TGGTTGACAG CAGCTTCTTC
TGCTCCAGTG CTGCGGTCAA CCCGACCTTG ACGATCATTG CCAACGCTCT ACGGGTGGGC
GATCACCTGC TCGAGCGGCT GGGCTAG
 
Protein sequence
MAINQTDYDI VIIGTGAGGG TLLHRLAPSG KKILILEQGP YLPREKENWS ATEVYGKERY 
HTHDQWYDKQ GTAFRPQMSY WVGGNTKVYG AALLRLRERD FEAVQHRDGI SPEWPLKYAD
FEPYYSQAEA LFDVHGEDGN DPTAPPRQTP YPYPAVSHEP RMQEIYDGLK GQGLKPFHLP
VGLKLNETER ALWDCIRCDS CDGFPCLVRG KADSEVNAVT PALTYPNVTL KTEAKVLRLL
TNESGREVTA VETQIGGETV LFKGDVVVVS CGAANSAALL LRSVSDRHPN GLANSSDQVG
RNFMKHLTTA MVALNAKKNE SVYQKTIAVS DFYWGEDGYD YPMGFIQNTG NVLPDMMPAE
APGLLASLLK FVPRLDVDLG PQYQAAAQHS VGWWFQTEDL PDPNNRVRVV NDQIHLDYTP
NNTESTQRLV HRWIDILKAV DRADHVLPFD LYPRSSSPIQ VLGHQCGTCR FGEDPTTSVL
DLNCQAHDVD NLYVVDSSFF CSSAAVNPTL TIIANALRVG DHLLERLG