Gene Synpcc7942_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1901 
Symbol 
ID3775264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1973458 
End bp1975839 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content43% 
IMG OID637800342 
Productputative glycosyltransferase 
Protein accessionYP_400918 
Protein GI81300710 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0537767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.403992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATTG GCATTATTGC TCCCAGCTCT GTCCCCTTCT GCATTGGTGG AGCTGAAAAA 
TTTTGGTGGG GATTACAACG AGCCTTTAAT GATCTAACAC CTCACGCAGC TGAATTAATT
AAGCTACCAT CGCCTGAAAA TAATATCTAC GACTTAATTC AAAGTTATCA ACGGTTTTCA
CGATTTAATC TGAATCATTT TGACCTGCTA ATAAGTAGCA AATATCCTGC TTGGATGGTT
AATCATCCCC GCCATCTTTG CTATCTTCAA CATCGTCTTC GTGGCCTTTA TGATACTGAT
AATGGTCATC AAGGAAATTT GAACTTAGCT CGCTATTCTC AACTCCAAAA AATTGAGAAA
ATCCTTGAAA AAAATGGCGA TCGCTCCCAG CTAGAGCTTT TATTCTCTGA ATTGAACGAT
TTCTTGAAAA ATGAGCCTAG TGATTCTAAG CTTTTAGAGT TCCCTGGACA ATTGATTCGG
AAAGTTATTC ACTTTTGCGA TCGCATTGCC TTGAGCCCTA CTGAGATTGC TGGTTACGCT
GCGATTTCCC AAAATGTTGC CCAGCGCAAA AACTACTTTC CTGATTCGGT ACGTCCGAAA
GTAATTTATC ATCCCTCTGA TTTAGAAAAA TTCTATTGTG ATAAACAAGA TTACTTTTTT
ACTATTAGTC GTTTAGATAA CGCCAAGCGA GTCAGTTTGC TGATTTCTGC AATGCAGCAA
GCTAAAACAG ATATTCCATT ACTGATTGCA GGGACTGGAC CTGAATCAGA GAGTCTTAAA
AAACAAGCTG GAAATGATTC CCGAGTTCGA TTCTTAGGTT TTGTGAAAGA TCGAGAGGTA
ATTGATCTCT ATGCCAATGC TTTAGCTGTC CTTTATGTTC CCTATGACGA AGATTATGGG
CTAGTTCCTA TTGAAGCCTT TCGCTCTGGT AAGCCTGTTA TTACAGTTAC GGATGCTGGT
GGCCCTTTAG AGTTTGTTCA AAACTTGACA ACGGGTTGGG TCGTGCCTCC AAAACCTGCA
GCGATCGCTC AAGCAATTGA TGACTGCAGC CAGAACCCTA GTCGTGCAGC GGAGTATGGC
AAGGCGGGGC AAGCGATCGC AGAAACGATT ACATGGGAAA AGACTGTTCA AGAACTGCTA
GATCTAGTCG AACCCAAATT ACAGCCTGCT ACACCGCGAC GATCGCAGCT TGTAATTGGA
TCAACCTTCC CTGTTACCCC ACCCCGGAGT GGCGGGCAAA GCCGAATTTT TAATCTTTAT
CGAGCACTCA GTCATGAATT TGAGGTTGAA TTGATCAGCC TTGGCCCAGT TAATAGTGAG
CCCAAAACCT ATCAAATCAA TCCAAACTTC TGTGAAGTTG TTATTCCTAA ATCACCGATC
CATCAAAAAC TAGAAAGTCG GCTCGAGCGT CAAGCTGGTA TTTCAATTGG AGATCTCGCT
GCAACTTTTT ATGGCGATCG CACACCGCAA CTGCTTGAAG TGATTGAGCA GCGACGATCG
CAGGCTGATG TTCTGATTGC TTCTCATCCT TACTTACTGC CTTGGCTACA GCCCAATTCC
AGGGCTAATA AATTAGTCTA TGAAGCACAC AACTGTGAGT TTGACCTCAA GCGGGGTCTG
TTTCCCCAAA ACCGTAAAGG AACGAATTTA ACCCAAGAAG TACTAAAGCT GGAAGAAAAT
GCTTGTCAAC AAGCTGATCT GATTGTGGCT TGTCTAGAGG CTGATTGGGA GCGGCTCTGC
CATCTGTATC ACTGCCAAAA GACACCTTAT GTTTTAGTTC CTAACGGAGT TAACTGCCAA
GAAGTTACTT TTGTTGATTC CGCCCTGCGA TCGCAGTGGA AACAAAAGAT TGGTTATTCA
TCCTTTATTT TTCTATTTAT CGGAAGTTGG CATCAACCGA ATGTCGAAGC TGCCCACAGC
ATTGTCAGTT GGGCTATCGA CTTCCCCGAG CAACAGTTTT TAATTGTGGG TAGTGTTGGC
CACGCGATTG AGCAAGAGTT TAGACGAATT CCTCCTAATA TTCATTGCTT GGGTGAAGTG
GATGCACGGA CTAAGCAAGT GGCTCTTAAT GTCGCTGATG TAGCACTGAA TCCTATGACG
TCTGGTTCCG GTTCTAACCT AAAAGTAGTG GAATATTTAG CCGGTGGCTT ACCGCTCATT
ACTACTGAGT TTGGAGTTCG AGCATTGCCT TTAGAACTGC AGGAACAGTG TCAGATTGGT
GCTTTGAATG AGTTTCCAAT GCTGATGCAA AAAGCGATTG ATCAACCAGA CTTACACGAT
CCTATCGCCC GCAGAAATGC TCGCTATATC GTTGAGCAAA AACTTGATTG GCGAGCGATC
GCCAAGGACT ACGCGATCGC CCTCAAAGAA TTGTTGAAAT AA
 
Protein sequence
MQIGIIAPSS VPFCIGGAEK FWWGLQRAFN DLTPHAAELI KLPSPENNIY DLIQSYQRFS 
RFNLNHFDLL ISSKYPAWMV NHPRHLCYLQ HRLRGLYDTD NGHQGNLNLA RYSQLQKIEK
ILEKNGDRSQ LELLFSELND FLKNEPSDSK LLEFPGQLIR KVIHFCDRIA LSPTEIAGYA
AISQNVAQRK NYFPDSVRPK VIYHPSDLEK FYCDKQDYFF TISRLDNAKR VSLLISAMQQ
AKTDIPLLIA GTGPESESLK KQAGNDSRVR FLGFVKDREV IDLYANALAV LYVPYDEDYG
LVPIEAFRSG KPVITVTDAG GPLEFVQNLT TGWVVPPKPA AIAQAIDDCS QNPSRAAEYG
KAGQAIAETI TWEKTVQELL DLVEPKLQPA TPRRSQLVIG STFPVTPPRS GGQSRIFNLY
RALSHEFEVE LISLGPVNSE PKTYQINPNF CEVVIPKSPI HQKLESRLER QAGISIGDLA
ATFYGDRTPQ LLEVIEQRRS QADVLIASHP YLLPWLQPNS RANKLVYEAH NCEFDLKRGL
FPQNRKGTNL TQEVLKLEEN ACQQADLIVA CLEADWERLC HLYHCQKTPY VLVPNGVNCQ
EVTFVDSALR SQWKQKIGYS SFIFLFIGSW HQPNVEAAHS IVSWAIDFPE QQFLIVGSVG
HAIEQEFRRI PPNIHCLGEV DARTKQVALN VADVALNPMT SGSGSNLKVV EYLAGGLPLI
TTEFGVRALP LELQEQCQIG ALNEFPMLMQ KAIDQPDLHD PIARRNARYI VEQKLDWRAI
AKDYAIALKE LLK