Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4085 |
Symbol | |
ID | 8139459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4665565 |
End bp | 4667208 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644871700 |
Product | flagellin domain protein |
Protein accession | YP_003023858 |
Protein GI | 253702669 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 136 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCA ATGACATTTC CCTTGTGGCC GGCATGAGGG ACAACCTGCT CGAATTGACC AAGACTGCCA AACTGACCAA CCGGACGCAG GAGCGCCTGG CTTCCGGCAA GCGGGTCAAT TCCTCGTTAG ACGACCCCTC CAATTTCTTC GCCGCCATGG AATTGAAGAA CTACGCCGAC GATCTTTCCC TGGTCCACGA CAACATTAAT AACACTATTC AGACGGTAAA GGCGGCTTCC AATGGCATCA GCGCCGTCTC CACTCTCATT GCAGGTATGA AGAGTCTCGT GGACGCAGCG AGGATAACGT CGGATCCCGC GGCGAAGGCA AAGTTCGCAA GGTCGTACGA TCAGCTTTAC TCGCAGATCA ACTCGGTGGT ATCTGACAGC TCCTACAATG GCGGCAACCT GATCAGCGAC AACGACCCGA GGCTCCTCAG CTGGAGCTCT CAGAACCTTC CGGTAAGCCT GAACCCAACC GGCACCTCGG AGTACGTCGT GGAAGGAAAG TTCCTGGGTG CCGGCTACGC CTTCTACGAG GACGGAGTCC CCGCCACAGG ATGGGTCCCC AATGACCTCG GGACCAAACT GGCCCCGGTT GACGCAGATG CGACCGAGTT CCCTGTTCCC GGCAACGCCC CCACGCCCCT CGATTTCGTC TTCATCATCG ACTCCACGGG AAGCATGGGC GGCTACATCA ACATGGTGGA AGCGAACGCC AAGTCCTTCG TGAGCAACAT GGTAGCCCAG GGCGTCGACG GCAGGTATTC CTTCGTGAAG TACGGAGACG TCTCCTCGGG CGACGCCGCC GTCATCGCGG CTCCCTCTTT TTTCACCGAC CCTGACGCCT TCGCCGCCGC GATCACCGCG TCTGCCGACA GCCCTTCGGG GGGCGGAGAC TTCCCGGAAT CCGGCCTTGA GGCGATACTT GGGGCTGTTT CCGGGCTTGC CTTCCGCGCG GAGGCGACCA AGCGTATGGT GCTGCTGACC GACGCCACGG TGCATACCTT CGCCGATGGA TTCTCCGCCG AGACCATCGC CGGAACCGCC GCCGCAGTCT CAGCAGCGGG AATCCAACTC GACATCGCCA CGGTCGCCGG TGGCGTGACG CAGTTGACGC CGCTTGCCTC CGCGACCGGC GGTACCGTCT ACGATATCAA CGATGCCTCC TTTTATACCG ACAACTTCGG CGTGACGCCC GATCCCGCCA CCCCGACCCA TAACCTGACC GTCGTCTCCG CCGACTACGA TACCGACGCG TTCTCCATCC GCTTCGATCC GCCGGGGGCG ACGAAGTCGA TGACGGCTGA GAAATCGGGG TTGGGGCTGT ACCACAGTTG GGTGTATCAG AATTTCGCTA CCGAGGAGGG GCTCGCCGCC GCGACCCAGG CCCTTGACGC CGCCGACCGC ATCCTCCGCA CCGAATCCGC CAACTTCGGC TCCGGCTCGA CCATACTGGT GACCCGGGAC ACTCTGGCAT CCGAGATAGC CAACATCGTC CGGACCGGCA GCGACAACCT TACCGCGGCG GATATGAACG AAGAGGCCGC CAACATGCTC ATGCTGCAGA CCCGCCAGAG CCTTTCGACG ACCTCTTTAA GCATCGCCTC CCAGTCCTCG CAGATCGTTT TGAAGCTTTT CTAA
|
Protein sequence | MAANDISLVA GMRDNLLELT KTAKLTNRTQ ERLASGKRVN SSLDDPSNFF AAMELKNYAD DLSLVHDNIN NTIQTVKAAS NGISAVSTLI AGMKSLVDAA RITSDPAAKA KFARSYDQLY SQINSVVSDS SYNGGNLISD NDPRLLSWSS QNLPVSLNPT GTSEYVVEGK FLGAGYAFYE DGVPATGWVP NDLGTKLAPV DADATEFPVP GNAPTPLDFV FIIDSTGSMG GYINMVEANA KSFVSNMVAQ GVDGRYSFVK YGDVSSGDAA VIAAPSFFTD PDAFAAAITA SADSPSGGGD FPESGLEAIL GAVSGLAFRA EATKRMVLLT DATVHTFADG FSAETIAGTA AAVSAAGIQL DIATVAGGVT QLTPLASATG GTVYDINDAS FYTDNFGVTP DPATPTHNLT VVSADYDTDA FSIRFDPPGA TKSMTAEKSG LGLYHSWVYQ NFATEEGLAA ATQALDAADR ILRTESANFG SGSTILVTRD TLASEIANIV RTGSDNLTAA DMNEEAANML MLQTRQSLST TSLSIASQSS QIVLKLF
|
| |