Gene Synpcc7942_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1024 
Symbol 
ID3773952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1038106 
End bp1039098 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content61% 
IMG OID637799444 
Producthypothetical protein 
Protein accessionYP_400041 
Protein GI81299833 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGATG ATCAGAGCGA TCGCCTCCGC TGTGCGGGTT TGCCCTTGAA ATTTCCTGTT 
TCGCTGCAAC TAGCGATCGC CACAGCCCTG TGGGGTGGCA CCTTTACGGC TGGTCGAATT
GCGGTGCAGC AACTGTCGCC CTTGGCTGTC GCTTGCGGTC GCTATCTGTT GGCAACCACC
GTGCTGCTGC TGATTCTTTG GCAGCGAGAG GGCTGGCCCC CACTGAATCG CCGTCAGCAG
CTATTGCTAT TCGGCTTGGG CGTGAGCGGC ATTGCCCTCT ACAACTGGCT GTTTTTCATT
GGCCTCAGCC TCATTCCCGC CAGTCGCGCG GCACTGATCA TTGCCCTCAA TCCAACCGCG
ATCGCACTGG GAGCGGCGAT CTGGACTGGC GATCGCCTGC GATCGTGGCA GTGGGCTGGG
GTAGGTCTGT CGTTGATCGG CGCAATCCTG CTTTTGGGTA GCCGTCAGGC TGGAGCACTG
ACGCTACCGG GCTGGGGTGA TCTGGCCTTA GTTGGCTGTG TCCTCTGCTG GACGGTCTAC
AGCCTGCTAG CTCGACAGGC CCTGCGATCG CTCAGTCCTC TGACCGTCAC GACTGGTGCT
TGCTGCTGGG GCAGTGTTTT GCTGATCGGA CTCTGGCTTG GGCAAGGGGC ACAGCTGCCA
GTCAACGTCT CGTTCTCGAC TGGATCAGCG ATCGCGTTTC TCGGTCTGGG TGGGACTGCC
CTAGCCTTTT GTTTGTATGC CAATGGCATC GAGCGCTTGG GGGCAGCGCG GGCCGGTCTG
TTTATCAACC TTGTGCCCGT GTTTGGTAGT GCGATCGGAG CGCTGTTGCT GCAGGAACCG
CTCTCGGGTT TGACGCTACT CGGGGGCTTG CTGGTCTTGG CAGGGGTCGG TCTGGGTACG
TTGCAGCGAT TGCAACCGGT ACCAATCTCA ACGACAGAAC CAGTGGGAGG CGATCGATCA
CCGGGCCCGC CAGCCCTTGG GCACGATCGC TAA
 
Protein sequence
MGDDQSDRLR CAGLPLKFPV SLQLAIATAL WGGTFTAGRI AVQQLSPLAV ACGRYLLATT 
VLLLILWQRE GWPPLNRRQQ LLLFGLGVSG IALYNWLFFI GLSLIPASRA ALIIALNPTA
IALGAAIWTG DRLRSWQWAG VGLSLIGAIL LLGSRQAGAL TLPGWGDLAL VGCVLCWTVY
SLLARQALRS LSPLTVTTGA CCWGSVLLIG LWLGQGAQLP VNVSFSTGSA IAFLGLGGTA
LAFCLYANGI ERLGAARAGL FINLVPVFGS AIGALLLQEP LSGLTLLGGL LVLAGVGLGT
LQRLQPVPIS TTEPVGGDRS PGPPALGHDR