Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0809 |
Symbol | |
ID | 3775987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 803073 |
End bp | 804815 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637799226 |
Product | hypothetical protein |
Protein accession | YP_399828 |
Protein GI | 81299620 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCGA GCCGCTCACC AAGGATTATG GGCCCGCGCC GACCCGCGAT CGCGATCGCC CTCGGTCTGC TGCTGTTAGC TTTCCTGATC TTGGTGGGGT TGAGTCTTGG CTCCTCAACA AGTCTGATGT CTCACGATGA GGGCTATTAC GCCCTGCAGG CCCGTTGGAT TGTGGAAACG GGTGATTGGG TAACGCCGCG CTGGTGGCAA GAGCCACTCT ACGACCGCAC AATCGGGGTG CAATGGCTGA TCGCTGCGAG TTACAAGCTG TTTGGCTTCT GCACCACTGC CGTCCGCCTA CCGGCTTTGC TCAGTGGACT GGCAACGCTC TGGTTGACCT TTGCGATTGG CGATCGCCTC TTGCCTCGTC CCCAAGCCCT GTTGGCGGCG GGCATTCTGC TAGTGACGCC CCTTTGGTTT CAGTACGCGC AACTAGCAAC CCAAGATATG CCGTTGCTAG CGGTCGAGTT GCTCTCGATT TGGGCGCTCC TACAAGCCGT CTCGGGCGAT CGCCGAGCTA ATCTCTGGGG CTTTGTGGCG GGTTTGGGGG TTGGCCTTGG CTTTTTGATC AAAGGCTTCA TGATTGGCGT GCCACTGCTT GCGATCGCTC CTTGGTTTTT CTGGTATGCG CCGAAGCTAC TGCGCAATCG TGGCCTCTGG CTTGGCCTCA TCGTCGGCTG GATTCCGGTC GGGATTTGGC TCTGGGGCAG TCAGCAGCGC TGGGGTGATC TCGCGATCGC CCAACTCTTC GACAAATTTT TCTTTCTGGC CAGCGAAGAT CTCTACAGCC AGCCTTGGAC TTTCTACCTC TGGAACTTGC CGCTCAATGC TTTCCCATGG CCACTGTTTG GGCTAATTGG CTGGGTTCGC CTCTGGCTGC GACCGGAACG CGATCGTGAT TTACAGCGGC ACTATCAATG GCTACTGGGT GTCTATCCGC TGCTACTATT GCTGATTCTC TCCAGCTTTC GCACCCGCAC GCCTTACTAC GCCTTGCAGC TGTTGCCCTG GGTGGCTTTG CTGGCAGCAA TGAGCTTGAG CTGGCTGGCG ACCAGTCTGA AGCCATCCTC TGGATTTAGC TTGAGTGCTC GTCAGCCGAC TCATCGCTGG ACTGCAATGC TGAGCTGGAC CTTTGGCGGA TTGGGACTGG TGTTGGTACT CGCTGCGATT GCCCTGCTCT CGGGTCAAAT TTCAGCCCTT GCCGATCCGA GTTTGCGTCC CTATGGCTGG GTGGCGATCG CGCTAGGGCT GGGCTGGCTA ACTCTGCCGA TTGTCTATAG CCAGCGGCAA CAACTGCGGA AAGCCAGTCT GCTTTGGTGC TGTGGCTGGC TGCTCGGACC CTGGTTGGGG CTAGCCACCG TCAGCCATTG GCACCTGCTG AGCGATCGCA GTCCCGTAAC GCGCTACGCA CTGCAACAAC CGGCAGTGCA AGCTCTATTA CGAGAAGCAC CCGTCAATTT TTGGGCGATC GATCCGGTGG ATGGCACAAC TCATCAGCAG TGGATTCAAC TGGCCCTCAA CAGTCCACGC TTGGGTCAGC GCCTGCAGAC GATTACCGAT CGGCCAGCGG GCGATCGCGT TTGGGTTGCC CCTGCCCAAG TCCCCGCCTT GCCAGACAAC TGGCAGCACC GTGCCTCCAT GCAGGGCTGG GTTCTCGTGG AAGCCGTGCT AGCACCGGCC CCCCAAGTCG CTGTGCCAGT GGAACCTGGG CCCCCTCCCG AGACTGAACC CCAGACGCCC TAA
|
Protein sequence | MFSSRSPRIM GPRRPAIAIA LGLLLLAFLI LVGLSLGSST SLMSHDEGYY ALQARWIVET GDWVTPRWWQ EPLYDRTIGV QWLIAASYKL FGFCTTAVRL PALLSGLATL WLTFAIGDRL LPRPQALLAA GILLVTPLWF QYAQLATQDM PLLAVELLSI WALLQAVSGD RRANLWGFVA GLGVGLGFLI KGFMIGVPLL AIAPWFFWYA PKLLRNRGLW LGLIVGWIPV GIWLWGSQQR WGDLAIAQLF DKFFFLASED LYSQPWTFYL WNLPLNAFPW PLFGLIGWVR LWLRPERDRD LQRHYQWLLG VYPLLLLLIL SSFRTRTPYY ALQLLPWVAL LAAMSLSWLA TSLKPSSGFS LSARQPTHRW TAMLSWTFGG LGLVLVLAAI ALLSGQISAL ADPSLRPYGW VAIALGLGWL TLPIVYSQRQ QLRKASLLWC CGWLLGPWLG LATVSHWHLL SDRSPVTRYA LQQPAVQALL REAPVNFWAI DPVDGTTHQQ WIQLALNSPR LGQRLQTITD RPAGDRVWVA PAQVPALPDN WQHRASMQGW VLVEAVLAPA PQVAVPVEPG PPPETEPQTP
|
| |