Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3950 |
Symbol | |
ID | 8139324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4533715 |
End bp | 4535337 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871567 |
Product | glycosyl transferase family 39 |
Protein accession | YP_003023725 |
Protein GI | 253702536 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.00000131348 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCCAAG CGCAGCAAGC TAAGGGGCTC TGGCTCCCTT TCCTGATCCT GGCGGGGACC TGCCTCCTCT TCTCGCTGGT CCTCCCCTTC TTCCCCGTGG ACGAGACCCG CTACCTCTCC GTGGCCTGGG AGATGAGGGT GCACGACTCC TTCATCGTCC CGATCCAGAA CGCGCTCCCC TATTCGCACA AGCCCCCCCT GCTCTTCTGG CTGATCAACC TGGACTGGCT CCTCTTCGGG GTCAACGAGG GGACGCTCCG CTTCATCCCG CTCATCTTCA GCCTTTTCAA CGTCTGCCTG GTCTACCGCA TCGCCCTTGA GCTCTGGGAG GACAGCAAGC TCGCGCTGAA CGCCGCCGTC ATTCTCGGCT CCACCTTCTC CTATCTTTTG TGGTCCTCGG TGATCATGTT CGACGTGATC CTCTCCTTCT GGGTCCTCGT AGCCGTTTTC GCCTTCATCC GCGCCGGCAC AAAGATGAGG TTCGCCGACT TCGCCCTGGC GGGCGCGGCC ATAGGATGCG GCATCCTCAC CAAGGGACCG GTGGTGCTGG TCTACATCCT GCCGGTGGCG CTCTTCGCCT TCTGGTGGCA GCCCAAAGGC GAGGTCGCTC CCAAGTGGTA CGGCTTTTTG CTCCTCTGCC TGCTGATAGG TATCGCGGTG GTCCTCGCCT GGCTCATCCC GGCGGCGCTC ACCGGGGGGG AGGTTTACCG AAAGGCGATC CTTTGGGGGC AGACGGTGCA CCGCATGGCC AACTCCTTCG CCCACAAAAG GCAGCTTTGG TGGTATTTCC AGTGGATCCC GGTCCTTCTG GCCCCTTGGA TCTTCTTCGC CCCCTTCTGG CGCGGCTGCC GCCGCCTGCC CCTGGACGCG GGGACCAGGC TGGTGCTCAC CTGGATCGTT GCCGGTTTCG TCGTCTTCTC CTTTTTGAGC GGAAAGCAGG TGCACTACCT GATCCCCCTG ATCCCCGCCT GCAGCCTGCT GATGGCGAAG GCGATCGCCA GCTCCGAGGA GAGCGGGCGG CGCTTCCAGC TCCCGATCGC CGTCTTCTAC CTGGTCCTCG GCGCGGCCAT CGCGATTATC ACCTTCCTGA AGCAGGGGCG CGCGCTGCAA AACTTCGATC CCGGCGAGTT GAGAATCGCC GCCGCCGGCC TCATGCTCCT CGGCGCCGCG CTCTCCTTCC CCAAGCCGCG CGACGCCTCG GCCGCCCTCA AGCTGGTGGC GCTCTCAGCA CTCCCCTTCT TCGCCCTTGT GGCGGTCGGT TGCCACACCT TCTCCGGGCG CTACGACCTC CACGCCGTCT CGGCCGCGGT ACTAAAGAAG CAGCAGGAAG GGTATCAGGT GCTGCACCGG GGAAAATACC ACGGCCAGTA CCATTTCATG GGGCGGCTGC AACTGCCGCT GCTGCAACTG GAGGATAGCG ACGAGATCCG CCGCTACGCG CAGACCCGCG AAAAGGTGGC GCTCTTGAGC TATACGCCCG ACGACCAGGC GGTGCAGCCG GAGGAGGCCT TCTTCCGGCA GCCCTTCAGG AGCAAGCAGG TGGTCCTCTG GAACAGCCGG GGGATCCTGC AGAACCTGGA CGGCGCCAAG GCCGCCGCGA CCCCGCCCCC AGGGAAACCA TAA
|
Protein sequence | MSQAQQAKGL WLPFLILAGT CLLFSLVLPF FPVDETRYLS VAWEMRVHDS FIVPIQNALP YSHKPPLLFW LINLDWLLFG VNEGTLRFIP LIFSLFNVCL VYRIALELWE DSKLALNAAV ILGSTFSYLL WSSVIMFDVI LSFWVLVAVF AFIRAGTKMR FADFALAGAA IGCGILTKGP VVLVYILPVA LFAFWWQPKG EVAPKWYGFL LLCLLIGIAV VLAWLIPAAL TGGEVYRKAI LWGQTVHRMA NSFAHKRQLW WYFQWIPVLL APWIFFAPFW RGCRRLPLDA GTRLVLTWIV AGFVVFSFLS GKQVHYLIPL IPACSLLMAK AIASSEESGR RFQLPIAVFY LVLGAAIAII TFLKQGRALQ NFDPGELRIA AAGLMLLGAA LSFPKPRDAS AALKLVALSA LPFFALVAVG CHTFSGRYDL HAVSAAVLKK QQEGYQVLHR GKYHGQYHFM GRLQLPLLQL EDSDEIRRYA QTREKVALLS YTPDDQAVQP EEAFFRQPFR SKQVVLWNSR GILQNLDGAK AAATPPPGKP
|
| |