Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3502 |
Symbol | |
ID | 8138874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4040570 |
End bp | 4042153 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644871121 |
Product | hypothetical protein |
Protein accession | YP_003023281 |
Protein GI | 253702092 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 2.7799500000000003e-32 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCGGGC TTTTCGTCGC CGCCCTGCTG ATACGGCTAT ATTTCGTACC GTTTTTCAAG GTGATATCCG CCGACGGAGT TGGGTATGTA ACTGCCGCCC GGAGCCTCTC CCGGGGGAAT CTAGGCGACC TCACCATCTA CGGGGTGGTC TACCCGAGCC TTACCGCGGC GCTGAATCTG CTGACCGGCG ACATGGAACT GGCGGGGCGC TGTGTCTCGG CGTTCATGGG AAGCTTGCTG GTCGTGCCGC TCTACCTGCT TGGGGTGGAA TTCTTCTCCA AGAGGGCAGG GCTTCTGGCC TGCATCCTGG TCCTCGCCTG GCCCCCCCTG AGGATGTGGG CGGGCGAGGT GATGACGCAG GGGACCTACA TAACGCTGAT GCTTGGCGGC GTGTACGCGA TGTGGGTGGC CTTCAGAAAA GACTCCAGCC GCCTCTGCTT CGGTGCGGGC GTGCTGATGG CTTTTGCCTA CCTGACGCGC CCCGAAGCTT TGGTCACCTT CCTTGCCGTA GGCGCAGCCC TGGTGGTTCC CGCACGGGTG AAAGGGTTGT CCTGGAAAAG GATCGCCGGC CTAATCGCGG CGACAGGTGC CGGTTTCGCC ATCCCGTTGA TCCCCTACGT TTTCCTCGTG CACAGCGTCA CAGGAAAATG GCAGCTTGCG GGGAAGACCG CGAACACGCT CGCCGACGCG CTGAGCCAGT ACCTGCAGCG TCCCGACATG AAGAACGAGG CCTCCTTCAA GGGGATCGGC GTACTCGACG TGATCCGGCT CTATCCGGAG TTCCTTTGGG GGAACTTCCT CAAGAACCTG AAAGAAACCT TCCAGACCAT GGTCCCCACC TACCTGTGGC TCCTCTCCTT CATCGGAATT ATCGGCTACG GCTGGAGCAG GGAAAAGTGC GGCAGACAGC TGGTCCTCCT GGCGACCTTC GCGCCGTTAG CGGTGATCAT GGTCCTCTTC TTCGTCGGGC CCGAGTACCT GCAGCCGTAT CTCCCCGTTC TCTTTCTCTG GGCCGCTTCC GGTTTTCTGT TGCTGGAGGA GCGCCTGGCG TCGTCCTTGC GGCTGGATAG ATTCGAACTC GTCTCCAGGA TGCGAAGGGG CATCCCGGCC TCCGCGCTCG TGGCGGGGTG GATCACGATT TCTCTGCTCG TCGCACAGGT GCGGGAGATT AGCGACGAGC CGTACCACTA CTCCCAGGAC GGCGGGCGGT ACGACCAGAA GAGGATCGGT CTGCGGCTCA AGAAGTTGCT CCCTCCGGGT TCGCGGGTCA TGACCAGATG GGGGCGCATC ACCTTCTACT CCGAGATGGA GATGGTGATG ATCCCGCAGG CGGGGTATCC GGAACTGCTG GATGCCATCC GCACCAGCAA GGTGAAGTAC GTCATCGTCG ACGGGATGCT TACCGCCGCG CGTCCCCAGT TCGGCCTGCT CTACCGCCCC CTGTTCGAGA CACCGGAGAC GATCGAGTAC AGTGAAAAGG AGGCAGGCGG GGAAGCCTAC ATGCCCCTTC CCAACCTGAA GCTCATTTAC CTGCACAAGG ATCCTTCCAG CATCGGACTG GCGGTGTACG AGGTGAAGTC GTGA
|
Protein sequence | MAGLFVAALL IRLYFVPFFK VISADGVGYV TAARSLSRGN LGDLTIYGVV YPSLTAALNL LTGDMELAGR CVSAFMGSLL VVPLYLLGVE FFSKRAGLLA CILVLAWPPL RMWAGEVMTQ GTYITLMLGG VYAMWVAFRK DSSRLCFGAG VLMAFAYLTR PEALVTFLAV GAALVVPARV KGLSWKRIAG LIAATGAGFA IPLIPYVFLV HSVTGKWQLA GKTANTLADA LSQYLQRPDM KNEASFKGIG VLDVIRLYPE FLWGNFLKNL KETFQTMVPT YLWLLSFIGI IGYGWSREKC GRQLVLLATF APLAVIMVLF FVGPEYLQPY LPVLFLWAAS GFLLLEERLA SSLRLDRFEL VSRMRRGIPA SALVAGWITI SLLVAQVREI SDEPYHYSQD GGRYDQKRIG LRLKKLLPPG SRVMTRWGRI TFYSEMEMVM IPQAGYPELL DAIRTSKVKY VIVDGMLTAA RPQFGLLYRP LFETPETIEY SEKEAGGEAY MPLPNLKLIY LHKDPSSIGL AVYEVKS
|
| |