Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2713 |
Symbol | |
ID | 8138055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3160014 |
End bp | 3160979 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644870317 |
Product | CapA family protein |
Protein accession | YP_003022507 |
Protein GI | 253701318 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 0.292185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCGCCG TAGGCGACGT CATGATGGGG AGCGACTTCC CCGCAGCCAA ACTCCCGAAA GACGGCGGGC GCTCGCTCTT CGCCGCCGCC GCGCCCTTTT TCAAGCGCGC GGATATCGCC ATGGCCAACC TCGAAGGTCC TCTCTGCGAG GGGGGAACAC CCGTCAAAGA GCCCCTTCCC GGCAGGCGCT ACCTCTTCCG CACCCCCCCC GCCTTCGCCA GGAACTTAAG CGACGCCGGG ATCTCCATGG TGTCGCTTGC CAACAACCAC GCCCGGGACT TCGGCAGGGA AGGGCTCGCC TCCACCAGGA AGGCGCTGGC CCGGGCCGGG GTACTCTACT CCAGCAAGAA CGGGGAGGTC GCCGAGTTCG TGGTGCGCGG CATCCGCGTC GGCATCATCT CCCTCTCCTT CGGCCCTCCC CCCCGCTCCA TCACTTTTCC GGCACAGGCC CTGCAGGAGA TCGCCCGGGA AGCGGAGAAC TACGACATCC TGATCCTCTC GATCCACGCC GGAGCGGAGG GGCGTGACGC GCTGCACGTG ACCCCGGGGA TGGAGCGCTA CCTTGCTGAG CCGCGCGGCG ACCTCTTCTC CTTCGCGCAC CAGGCCGTTG CGGCGGGGGC GGACCTGGTG GTGGCCCACG GCCCCCACGT GCCGCGGGCG CTGGAGCTTT ACCGCGGCCG GCTGATCGCC TACAGCCTCG GGAACTTCGC CACCTTCGGC GGGGTGAGCG TTGCCGGCGC GAGCGGCTAC GCCCCGCTTC TCACGGTCCG CCTGGACAAA GACGGCTCCT TCCTGGAGGG GAGCCTGGAC TCCTTCCGGC AGACGTACCT TAGCGCACCC GCTCCCGATC CCAAGCGGCG CGCGCTTTCC CTGATGAGGG GGCTCTCCGC AGAGGATTTT CCCGACTCGC CCCTAAGCTT TGGCAGCCGG GGGGAGCTCA AAACCTTTAA CAAGGAGAAC TTCTGA
|
Protein sequence | MAAVGDVMMG SDFPAAKLPK DGGRSLFAAA APFFKRADIA MANLEGPLCE GGTPVKEPLP GRRYLFRTPP AFARNLSDAG ISMVSLANNH ARDFGREGLA STRKALARAG VLYSSKNGEV AEFVVRGIRV GIISLSFGPP PRSITFPAQA LQEIAREAEN YDILILSIHA GAEGRDALHV TPGMERYLAE PRGDLFSFAH QAVAAGADLV VAHGPHVPRA LELYRGRLIA YSLGNFATFG GVSVAGASGY APLLTVRLDK DGSFLEGSLD SFRQTYLSAP APDPKRRALS LMRGLSAEDF PDSPLSFGSR GELKTFNKEN F
|
| |