Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3562 |
Symbol | |
ID | 8138934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4131300 |
End bp | 4133033 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644871181 |
Product | filamentous hemeagglutinin outer membrane protein |
Protein accession | YP_003023341 |
Protein GI | 253702152 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 132 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG CTGTAATAGA TGCGGGAATC CATTCGGAGC AGCGTCCGGG CCGGGCGGAA GCCCCGGCGG GGCTCGAAGG GCTCAAGCCG TGCGCCGGGT TCCTCGGGAA CATGAGCGGA CGCGCGCGCA TGAACCTGAT GGCGTCCATC GTGATGACAC TGCTGTTCAC CGGATTCTGC GTCACTTCTT ATTTCATGCC GGATACGGCC TACGCGTGGC CGACCAAGTA CACCTCCTGC AGCTCCTGTC ACGCGCAGGT AGACCCGAAT GCGACCATCA CAGCCGCGAT CAACGGCGCC GTCGGCACCT CGGTGACGGT TGCGCCGGGA GGTAGCTTCG AGGTTGACTG GAAGGTTACC AACGTAACCA ACGCGGCAGG CGGTCAGGTC GGCGTGGGGG TCGAGATCGA CCTGCCGACC GGCTGGGGAC TGGCCAAAGG GACGGTGAAC GCTCCTGGCA TCCCCGGCTG GACCAGCGTG TGGGACGCGG CCAGCGGCGT GCCTGCCGGT TGGGCGACCG CCAACAGCTA CAGCACCTCG GCCGAGTTCC CGAACAGCCC GGTCGGCTAC ACCATCAACT ACGACAGCAC CGCCTGGGAC ACCGGGTCCA GGAACGCCGC CTACGACAAC GCCACCGCAG GCAAGGACCT GGACGGCATC GCCGACAACA TGGGGACCGA CGCAATCGTG ACGGTCCCAG CCGGGGCTAC GCCCGGGACC TACACCATGG TGGTCATGGG CGTTGGGCAT GACTCCGCGA AGTCGTACGT CGCGCAGGCG ATCACCGTGA CGGTATCCGG CGCAGGCGGA GATAGCGCCA AGCCGGTGGT CTCCGCGGGC TTCGCGGCAA CCACCCCGTC TCTTTCCCGG ACCATCGCCG TCTCCGGTTT CGCGGCGACC GACGACACCG GCGTCACCGG CTATATGATC ACGACGAGCG CCGCGGCGCC GCTTGCCGGC GACGCCGGCT GGCTCACCAG CGCGCCCGCC AGCTACACGG TGGCCTCCGA CGGGAGCTAC ACCCTGTACC CCTGGGCCAA GGACGCGGCG GGGAACGTAT CGCTCGCCTA CGGCGCGCCG GTCACCGTCC TTGTAGACAC GGTGAAACCG ACCGTCTCCT CCACGATTCC GGCCAACGGG GCTACGGCGA CCAACCTGAA CGGCGCGGTA ACCCTTAACT TCAGCGAGAG CGTGAACTGC GCCACGGTCA CTACCGGCAC GGTCACCATC TCCCCGGCGG TTGGCTGGAC CCGGTCGAGC TGCTCGGGAA GCCAGGCAAT CTTCACCCCG TCGGGCCAGT CGAATTCCAC CAGCTACACG GTGACGGTAG GGGCTTCCGT CGCCGACACG GCCGGGAACA CGCTGGCGGC GAGCTACCCC TTCGGCTACA CCACCTCGGC GCCGGCCCCC AACAACCCTC CGGCTCTGCC TGCCTCGCTC ACGCAGTACA AGAGCGACGG CACGACCGTT CTTTCCCGCG GTCTTTACAC CAACCTGACC ACGCTGATCT TCAAGGGGAC GCTAACCGAC CCCGACAGCG ACGCGGTGCA GCTCGACATC GAGCTTGCCG ACGTGGGGGC CGCATTCACC GGGCTGCCTA CCTGCAGCAG CACCCTGGTC GTAAGCGGTA CTACCGCCGC CGCCACATGC AGCAGCATAG CCAACGGCCG GTTCAAGTGG CAGGCCCGCG CCACCGACAG CAAGGGTTCG ACCGGCAGCT GGACGCAATA CTAA
|
Protein sequence | MKKAVIDAGI HSEQRPGRAE APAGLEGLKP CAGFLGNMSG RARMNLMASI VMTLLFTGFC VTSYFMPDTA YAWPTKYTSC SSCHAQVDPN ATITAAINGA VGTSVTVAPG GSFEVDWKVT NVTNAAGGQV GVGVEIDLPT GWGLAKGTVN APGIPGWTSV WDAASGVPAG WATANSYSTS AEFPNSPVGY TINYDSTAWD TGSRNAAYDN ATAGKDLDGI ADNMGTDAIV TVPAGATPGT YTMVVMGVGH DSAKSYVAQA ITVTVSGAGG DSAKPVVSAG FAATTPSLSR TIAVSGFAAT DDTGVTGYMI TTSAAAPLAG DAGWLTSAPA SYTVASDGSY TLYPWAKDAA GNVSLAYGAP VTVLVDTVKP TVSSTIPANG ATATNLNGAV TLNFSESVNC ATVTTGTVTI SPAVGWTRSS CSGSQAIFTP SGQSNSTSYT VTVGASVADT AGNTLAASYP FGYTTSAPAP NNPPALPASL TQYKSDGTTV LSRGLYTNLT TLIFKGTLTD PDSDAVQLDI ELADVGAAFT GLPTCSSTLV VSGTTAAATC SSIANGRFKW QARATDSKGS TGSWTQY
|
| |