Gene Synpcc7942_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1083 
Symbol 
ID3775033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1097799 
End bp1099175 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content59% 
IMG OID637799509 
Productglycosyltransferase 
Protein accessionYP_400100 
Protein GI81299892 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000014105 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTCTCA ATTCACCCTT GGATGCGACT GTCACCCCTT CTCCTGTGGC GTTGTCATCT 
GCTCCCGAGA TTGACTACTA CCAATACACT GCTGGTCGCC GCCGCAAGGC TGCCGTAGTC
TTGGTTGCCG TATGGGGGCT TACTGCTGCC CTGCATCTCA GCCTGTGGGG TGCCCCGATC
GTTTGGGCGA TCGCCCTGGG GTTGGGCATC CATTTCCTGC GGCTGATGGG TGCCCACCGT
CATGCTGAAG TTGTGGCGTT ACCCAGCGAT CGCCAAGACT GGCCCCAGGT TTCTCTGCTT
GTTGCTGCCA AAAACGAGGC CGAAGTCATT GAGCGCTTGG TTCACAACCT TTGTTCCTTG
GACTATCCCA GCGATCGCCT CGAGGTTTGG GTGATCGACG ATGCCAGCAC CGACGCCACG
CCCGATCGCT TAGCAGAGCT CCAAAGCCAC TATCCCCAAC TGCGGGTGCA CCAGCGACAG
GCGGGCGCTC CCGGTGGTAA ATCAGGCGCT CTCAATGAAG TTTGGCCCCA GACCCAGGGT
GAGTTCATTG CCGTCTTTGA CGCCGATGCC CAAGCTCCAG TAGATCTGCT CCAGCAGGTG
CTGCCGCGCT TCCAGCAACC TCAACTAGGA GCCGTGCAGG TCCGCAAAGC GATCGCCAAT
AGTGGCACCA ACTTCTGGAC TCGCGGCCAA ACCGTCGAAA TGATGCTCGA TGCTTACCTG
CAACAACAGC GGGTCGCGAT TGGGGGGATT GGTGAACTGC GCGGAAACGG GCAATTTATC
CGGCGAGCCG CCCTCGAGCG CTGCGGCGGC TTCAACGAAG AGACGATTAC CGACGATCTT
GATTTGGCCT TCCGGCTGCA TCTTGATCAC TGGTGGATTG ACTGTTGCAT CCATCCCGCC
GTGCAAGAAG AAGGGGTGGT TCGCAGTCTG GCGCTCTGGC ATCAGCGGCG TCGCTGGGCC
GAAGGGGGCT ACCAACGCTA CCTCGACTAT TGGCCTTGGT TGGTCCGCAA CCGTTTAGGA
CCGCAGCGGA GCGTCGACTT AGCGATTTTC TGGTTGACCC AATACATCCT GCCGACCGCT
GCTATTCCTG ACTTGGCCTT GGCGCTGATC CTGAAGCGAT CGCCCCTCTA TGGTCCTTTG
GCAGGTCTGA CCGTGGGCTT GACGCTGATC AGTCTGTTGC GTGGCATCCG GCAAGCGCGA
TCGCATGAGG CTCGCAATCC CCTGCCAATC GGACAAAGCC TGTTGGGTGT GATCTACATG
CTGCACTGGA TTCCAGTGAT GGCCTATACC ACCATGCGCA TGGCCTTCTG GCCTAAACAA
CTGCGCTGGG TGAAGACGGT TCACGTTGGC GAAACCCACT CGGCCCCAGT TGCCTAG
 
Protein sequence
MPLNSPLDAT VTPSPVALSS APEIDYYQYT AGRRRKAAVV LVAVWGLTAA LHLSLWGAPI 
VWAIALGLGI HFLRLMGAHR HAEVVALPSD RQDWPQVSLL VAAKNEAEVI ERLVHNLCSL
DYPSDRLEVW VIDDASTDAT PDRLAELQSH YPQLRVHQRQ AGAPGGKSGA LNEVWPQTQG
EFIAVFDADA QAPVDLLQQV LPRFQQPQLG AVQVRKAIAN SGTNFWTRGQ TVEMMLDAYL
QQQRVAIGGI GELRGNGQFI RRAALERCGG FNEETITDDL DLAFRLHLDH WWIDCCIHPA
VQEEGVVRSL ALWHQRRRWA EGGYQRYLDY WPWLVRNRLG PQRSVDLAIF WLTQYILPTA
AIPDLALALI LKRSPLYGPL AGLTVGLTLI SLLRGIRQAR SHEARNPLPI GQSLLGVIYM
LHWIPVMAYT TMRMAFWPKQ LRWVKTVHVG ETHSAPVA