Gene Synpcc7942_0809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0809 
Symbol 
ID3775987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp803073 
End bp804815 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content59% 
IMG OID637799226 
Producthypothetical protein 
Protein accessionYP_399828 
Protein GI81299620 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCGA GCCGCTCACC AAGGATTATG GGCCCGCGCC GACCCGCGAT CGCGATCGCC 
CTCGGTCTGC TGCTGTTAGC TTTCCTGATC TTGGTGGGGT TGAGTCTTGG CTCCTCAACA
AGTCTGATGT CTCACGATGA GGGCTATTAC GCCCTGCAGG CCCGTTGGAT TGTGGAAACG
GGTGATTGGG TAACGCCGCG CTGGTGGCAA GAGCCACTCT ACGACCGCAC AATCGGGGTG
CAATGGCTGA TCGCTGCGAG TTACAAGCTG TTTGGCTTCT GCACCACTGC CGTCCGCCTA
CCGGCTTTGC TCAGTGGACT GGCAACGCTC TGGTTGACCT TTGCGATTGG CGATCGCCTC
TTGCCTCGTC CCCAAGCCCT GTTGGCGGCG GGCATTCTGC TAGTGACGCC CCTTTGGTTT
CAGTACGCGC AACTAGCAAC CCAAGATATG CCGTTGCTAG CGGTCGAGTT GCTCTCGATT
TGGGCGCTCC TACAAGCCGT CTCGGGCGAT CGCCGAGCTA ATCTCTGGGG CTTTGTGGCG
GGTTTGGGGG TTGGCCTTGG CTTTTTGATC AAAGGCTTCA TGATTGGCGT GCCACTGCTT
GCGATCGCTC CTTGGTTTTT CTGGTATGCG CCGAAGCTAC TGCGCAATCG TGGCCTCTGG
CTTGGCCTCA TCGTCGGCTG GATTCCGGTC GGGATTTGGC TCTGGGGCAG TCAGCAGCGC
TGGGGTGATC TCGCGATCGC CCAACTCTTC GACAAATTTT TCTTTCTGGC CAGCGAAGAT
CTCTACAGCC AGCCTTGGAC TTTCTACCTC TGGAACTTGC CGCTCAATGC TTTCCCATGG
CCACTGTTTG GGCTAATTGG CTGGGTTCGC CTCTGGCTGC GACCGGAACG CGATCGTGAT
TTACAGCGGC ACTATCAATG GCTACTGGGT GTCTATCCGC TGCTACTATT GCTGATTCTC
TCCAGCTTTC GCACCCGCAC GCCTTACTAC GCCTTGCAGC TGTTGCCCTG GGTGGCTTTG
CTGGCAGCAA TGAGCTTGAG CTGGCTGGCG ACCAGTCTGA AGCCATCCTC TGGATTTAGC
TTGAGTGCTC GTCAGCCGAC TCATCGCTGG ACTGCAATGC TGAGCTGGAC CTTTGGCGGA
TTGGGACTGG TGTTGGTACT CGCTGCGATT GCCCTGCTCT CGGGTCAAAT TTCAGCCCTT
GCCGATCCGA GTTTGCGTCC CTATGGCTGG GTGGCGATCG CGCTAGGGCT GGGCTGGCTA
ACTCTGCCGA TTGTCTATAG CCAGCGGCAA CAACTGCGGA AAGCCAGTCT GCTTTGGTGC
TGTGGCTGGC TGCTCGGACC CTGGTTGGGG CTAGCCACCG TCAGCCATTG GCACCTGCTG
AGCGATCGCA GTCCCGTAAC GCGCTACGCA CTGCAACAAC CGGCAGTGCA AGCTCTATTA
CGAGAAGCAC CCGTCAATTT TTGGGCGATC GATCCGGTGG ATGGCACAAC TCATCAGCAG
TGGATTCAAC TGGCCCTCAA CAGTCCACGC TTGGGTCAGC GCCTGCAGAC GATTACCGAT
CGGCCAGCGG GCGATCGCGT TTGGGTTGCC CCTGCCCAAG TCCCCGCCTT GCCAGACAAC
TGGCAGCACC GTGCCTCCAT GCAGGGCTGG GTTCTCGTGG AAGCCGTGCT AGCACCGGCC
CCCCAAGTCG CTGTGCCAGT GGAACCTGGG CCCCCTCCCG AGACTGAACC CCAGACGCCC
TAA
 
Protein sequence
MFSSRSPRIM GPRRPAIAIA LGLLLLAFLI LVGLSLGSST SLMSHDEGYY ALQARWIVET 
GDWVTPRWWQ EPLYDRTIGV QWLIAASYKL FGFCTTAVRL PALLSGLATL WLTFAIGDRL
LPRPQALLAA GILLVTPLWF QYAQLATQDM PLLAVELLSI WALLQAVSGD RRANLWGFVA
GLGVGLGFLI KGFMIGVPLL AIAPWFFWYA PKLLRNRGLW LGLIVGWIPV GIWLWGSQQR
WGDLAIAQLF DKFFFLASED LYSQPWTFYL WNLPLNAFPW PLFGLIGWVR LWLRPERDRD
LQRHYQWLLG VYPLLLLLIL SSFRTRTPYY ALQLLPWVAL LAAMSLSWLA TSLKPSSGFS
LSARQPTHRW TAMLSWTFGG LGLVLVLAAI ALLSGQISAL ADPSLRPYGW VAIALGLGWL
TLPIVYSQRQ QLRKASLLWC CGWLLGPWLG LATVSHWHLL SDRSPVTRYA LQQPAVQALL
REAPVNFWAI DPVDGTTHQQ WIQLALNSPR LGQRLQTITD RPAGDRVWVA PAQVPALPDN
WQHRASMQGW VLVEAVLAPA PQVAVPVEPG PPPETEPQTP