Gene Synpcc7942_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2028 
Symbol 
ID3774215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2096615 
End bp2097676 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content57% 
IMG OID637800473 
Productglycosyltransferase 
Protein accessionYP_401045 
Protein GI81300837 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.949426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.904409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAATC TGTTAGTCAA TTTGGCAATG GTGCTGCGCC AGCCCACAGG GATCAGTACC 
TATGCCCTCA GTTTGTTGCC CTATCTGCGA TCGCTGCAGC CCACCCTCCT GAGCGATCGC
CTCTACGAGG GGTTTGACCA TCAGCTCATT TCCTCGCGGC TCTGCTCCGA TCGCGGGCGA
AGAGCCCGCT TGGCTAGGCT CTATTGGACC CAGACCCAAG TTCCCAAACT CTATCGCCGG
CTGGGTAGTC GACTGTTATT TTCACCGGCG CCCGAAGCCC CGATTGATCG CCACTGCCGA
TCGGTGGTGA TGGTTCATGA TCTGCGTCCC CTGCAACTGC CATCGCACTC TTGGCAGACC
TACTATTTCC GCTGGGTCGT GCCCAAAATT GTGGCGCAGG CGAGTCATGT GCTCTGCAAC
TCGGAAGCAA CCGCTACCGA TCTCTGTCAT TTTTATCAAC TGCCCGCCCA GAAAATTACG
CCGATTTACC TGGGCTACGA TCGCCAGCAC TATCAGCCTT GGAGCGGCAA AACCAGCAAC
TACTTCCTCC ACATTGGGCA ACAGTTTCCC CACAAGAATC TAGAGCGCCT GATTCGGGCG
TTTGCCCAGT TACCGACGGA CTATCAGCTG TATCTCGCTG GCAGTCGTCA TGCCTCCGAA
ACACCTCGGT TAGAGCAACT TGTGCATAGT TTGGGACTGC GGGATCGGGT GCAATTCCTG
CGCTACGTGG ACTACGCCGA TCTGCCGCGG CTGATTGGTG AGGCGATCGC CCTCGTCTAT
CCCAGCCTTT GGGAAGGCTT TGGCCTACCG ATTCTAGAGG CGATGGCCTG CGGCACCCCA
GTGATCACTG CCCATGGCTC TTCGCTATCA GAAGTGGGCG GTGAGGCCGT TCTCTACGTC
GACCCCTATC GCATTGAGGC GATCGCTGCC GCCATGCGCG ACCTAATCGA CGATTCAAAT
CTGCGGCAGT CCCTGCGCGA TCGGGGCTTC CAGCAAGCCA GCCGTTTCAG CTGGGAAGCC
ACCGGTAAAG AAACTTGCCA AGTGCTCGAA AAATTTCTTT AG
 
Protein sequence
MGNLLVNLAM VLRQPTGIST YALSLLPYLR SLQPTLLSDR LYEGFDHQLI SSRLCSDRGR 
RARLARLYWT QTQVPKLYRR LGSRLLFSPA PEAPIDRHCR SVVMVHDLRP LQLPSHSWQT
YYFRWVVPKI VAQASHVLCN SEATATDLCH FYQLPAQKIT PIYLGYDRQH YQPWSGKTSN
YFLHIGQQFP HKNLERLIRA FAQLPTDYQL YLAGSRHASE TPRLEQLVHS LGLRDRVQFL
RYVDYADLPR LIGEAIALVY PSLWEGFGLP ILEAMACGTP VITAHGSSLS EVGGEAVLYV
DPYRIEAIAA AMRDLIDDSN LRQSLRDRGF QQASRFSWEA TGKETCQVLE KFL