Gene Synpcc7942_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1201 
Symbol 
ID3774437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1229078 
End bp1230508 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content57% 
IMG OID637799628 
Productglycosyltransferase 
Protein accessionYP_400218 
Protein GI81300010 
COG category[H] Coenzyme transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0256337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACC ATACCGCCGC GATCGCGGCC CATTTTGATC AGCTAGCACC CAACCTCGAT 
CGCTGGCGGC GACGCAACCG CACCTACTAT CGCGATCTCG AAAAACTGCA TCGCTTCTGG
ATTCCTACCG GATCACGAGT GCTGCAGGTT GGTTCTGGTT TGGGCGATCT GCTAGCAACG
GTTGAACCAT CCTTCGGCAT CGGGATTGAT GTCTCACCGC AGGCGGTGGC AATCGCTCAG
CAACGCCATC CACAGCTGCA GTTCCACTGC TTGGCTGCCG AAGAGTTGAC ACCCGAGGCG
ATCGGCAATC CTGAGCCTTT TGACGCGATC ATCCTGACGG GAGTGCTCAG CTACCTGACG
GATATTCAAG TTGTGCTGGA GCAGCTCCAA GCCTTTTGTC ATCCCCGCAC GCGCTTAATC
CTTGGCTTCC ATAACTTCCT CTGGCAGCCC TTGCTCACGG CCGCTGAAAA GGTGGGACAG
CGATCGCCCC AACCGCCCGA AAGCTGGCTG GGTATGCAGG ATGTGCTCAA TTTACTGACG
CTGACGGGCT ATGAACCAAT CAAGCAGGGG CGGCGCTTCC TTCTGCCTCG GCAAATTCCG
CTACTGACCG GCTGGATTAA CCGCTGGATC AGCCCGCTAC CGGTGATTGA GCATCTAGCG
CTGACTAACT ATGTGATTGC GCGGCCCCTG GCTCAACCGC GATCGCAACC CACAGTCTCG
GTGATTGTTC CGGCTCGCAA TGAGGCGGGC AATATTGCAG CAGCAGTCGA ACGCTTGCCA
GAACTGGGGG CTGAGACAGA GCTGATCTTC GTGGAAGGCC ATTCCCGCGA TCAGACTTGG
GAGACGATCG AGCAGACGGT GGCGGAGTAT CAAGGGCCGC TGAAACTGCT GGCCTGCCGC
CAAACTGGCA AAGGTAAAGC CGATGCAGTC CGGCTCGGCT TCGATAAAGC CAGCGGCGAC
ATTTTGATGA TTCTCGATGC TGACTTAACT GTGCAGCCGG AGGATCTCGG CCATTTCTAC
CGCGCGATCG CCAGTGGCAG GGGCGAATTT ATCAACGGCT CTCGGCTGGT CTATCCGCGA
TCGCGGCTGG CGATGCCGGG GCTGAATACC CTTGCTAATC GAACCTTCGC CCTGATCTTT
TCCTTCCTAC TCGGTCAGCC GCTTAAGGAC ACCCTCTGCG GCACCAAGGT GCTCTGGAAA
ACCGACTACG ATCGCGTGGC GGCAGGGCGG AAATACTTTG GTGACTTCGA TCCCTTTGGT
GACTTTGACC TACTGTTTGG TGCCGCTAAA CTCGGCCTCA AAATTGTCGA AGTACCAGTG
CGTTATCAAG AGCGCAGCTA CGGCAGTTCC AACATTGCTC ATGTCCGCGA AGGGCTGATT
CTGGCACGGA TGTGTCTCTA CGCCGCTGGC AAACTGAAGT TCCCTCACTA G
 
Protein sequence
MNDHTAAIAA HFDQLAPNLD RWRRRNRTYY RDLEKLHRFW IPTGSRVLQV GSGLGDLLAT 
VEPSFGIGID VSPQAVAIAQ QRHPQLQFHC LAAEELTPEA IGNPEPFDAI ILTGVLSYLT
DIQVVLEQLQ AFCHPRTRLI LGFHNFLWQP LLTAAEKVGQ RSPQPPESWL GMQDVLNLLT
LTGYEPIKQG RRFLLPRQIP LLTGWINRWI SPLPVIEHLA LTNYVIARPL AQPRSQPTVS
VIVPARNEAG NIAAAVERLP ELGAETELIF VEGHSRDQTW ETIEQTVAEY QGPLKLLACR
QTGKGKADAV RLGFDKASGD ILMILDADLT VQPEDLGHFY RAIASGRGEF INGSRLVYPR
SRLAMPGLNT LANRTFALIF SFLLGQPLKD TLCGTKVLWK TDYDRVAAGR KYFGDFDPFG
DFDLLFGAAK LGLKIVEVPV RYQERSYGSS NIAHVREGLI LARMCLYAAG KLKFPH