Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3077 |
Symbol | |
ID | 8138427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3569102 |
End bp | 3570790 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644870681 |
Product | hypothetical protein |
Protein accession | YP_003022863 |
Protein GI | 253701674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0000000000627479 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAGAAA GCAACGTATT TGAGCAGCTT GCACTATGGA ACAGCGAAAA AAATATTTAT CAAGGATACG AGCTTGGTGC CGTTTACACC CGTCATGAAA CCGTTGATTT TATTCTCGAT CTCGCCGGGT ATTTAACCAA ACTTCCTCTT CATCAATTTA CTCTTCTGGA GCCTTCATTC GGGAATGGCG ATTTTCTCGT GCGTGCGGTG GAAAGGCTGC TCACCTCATA TTTCGATAAT CACGGCATGA ACACGGCCCT TAATGACTTG AAGCACGCGA TCCAAGGGTT TGAAATAGAT CCCGTAGCCG CTGAGAGCAC TGTGAAAAGC CTGTCGCAGC TGTTGGCTCG CTTCGGTTTT GTCGAGACGC AAATCTGTAC CCTGCTGGGG GGGTGGCTGA AAGCAGAGGA TTTCCTGCTA GCCGACATTC ATAACGCCTT TGATTTCGTC GTCGGCAATC CGCCTTATGT TCGGCAGGAG ATGATTCCTA GCTCCCTCAT GGCGGAGTAT CGATCCCGTT ATGAAACGAT CTATGACCGC GCCGATCTTT ATGTCCCATT CGTTGAGCGG AGTCTCCATC TACTGAAACC CGCTGGCCTC TTGGGTTTCA TCTGTGCTGA TCGGTGGATG AAGAACAAAT ATGGCGGGCC ACTTCGGTCG CTGGTGGCAA ACGGTTTCCA CCTAAAATAC TACGTTGACA TGGTTGATAC AGACGCTTTC CTCACTACCG TCAGCGCCTA TCCGGCCATT TTCGTTTTAT CCCGGGAGAA GCAGGCCAAA ACACGGGTCG CTCATCGTCC AAAAGTAGAA GCGCCAGTGC TTCAGAGACT CGCTGAATCG TTTGTGGGTG ATCATCCCGC AGAAACGCCG GTAATTGAAC TGATCAACGT CGTTAATGGC GCGGAGCCAT GGATTTTGGA CTCTCTGGAC CAACTTGCCT TGGTCCGAAG ATTGGAAGCT ACCTTTCCTT TGTTGGAAGA TGCTGGCTGT AAGGTGGGCA TAGGTGTCGC AACAGGTGCT GACCGGGCCT ATATAGCTCC GTTTGCCACT ATGGATGTCG AGCCGGACCG GAAGCTGCCT TTGGTGACGA CGAAAGACAT CGTGTCTGGG AAAGTGAGAT GGCAAGGGTT GGGAGTCCTG AATCCGTTTG ATGATCTCGG TTCCTTGGTT GATCTCGAGA AATACCCTCG ACTGAAGCGG TACCTCGAAG ACAGGCGAGA TGTTATTTCT GCTCGAAACT GTGCTAAGAG AAATCCGAAG ATGTGGTACC GCACCATTGA TCGTATATAT CCGGAGTTGA GGAAGAAAGA GAAACTGCTG GTACCGGACA TCAAAGGTGA AGCCAACATC GTTTATGAGT CGGGTGAATA CTACCCGCAC CACAATTTGT ATTACATAAC GTCTTCGGAG TGGGATCTTG GAGCGCTCCA AGCTGTTTTA CGGTCAGGGA TAGCGCGTCT TTTCGTAGGG GTCTACTCGA CCCAGATGCG TGGAGGGTAT CTAAGGTATC AGGCGCAATA TTTGCGGCGC ATTAGGCTGC CGCGTTGGGC GGACGTGCCG GATGACCTTA AGGAAAAGCT GAAGGCGGCA GGACAACGGG TGAATCCTGA GGAGCTGAAC CAGTTGGTCT TCCAGCTGTA CAAATTATCA GATAAGGAAA GTGTTGTTAT CGGTGGTTGT GGGAGTTGA
|
Protein sequence | MLESNVFEQL ALWNSEKNIY QGYELGAVYT RHETVDFILD LAGYLTKLPL HQFTLLEPSF GNGDFLVRAV ERLLTSYFDN HGMNTALNDL KHAIQGFEID PVAAESTVKS LSQLLARFGF VETQICTLLG GWLKAEDFLL ADIHNAFDFV VGNPPYVRQE MIPSSLMAEY RSRYETIYDR ADLYVPFVER SLHLLKPAGL LGFICADRWM KNKYGGPLRS LVANGFHLKY YVDMVDTDAF LTTVSAYPAI FVLSREKQAK TRVAHRPKVE APVLQRLAES FVGDHPAETP VIELINVVNG AEPWILDSLD QLALVRRLEA TFPLLEDAGC KVGIGVATGA DRAYIAPFAT MDVEPDRKLP LVTTKDIVSG KVRWQGLGVL NPFDDLGSLV DLEKYPRLKR YLEDRRDVIS ARNCAKRNPK MWYRTIDRIY PELRKKEKLL VPDIKGEANI VYESGEYYPH HNLYYITSSE WDLGALQAVL RSGIARLFVG VYSTQMRGGY LRYQAQYLRR IRLPRWADVP DDLKEKLKAA GQRVNPEELN QLVFQLYKLS DKESVVIGGC GS
|
| |