Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1245 |
Symbol | |
ID | 8136570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1452863 |
End bp | 1455325 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644868859 |
Product | hypothetical protein |
Protein accession | YP_003021064 |
Protein GI | 253699875 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 7.162309999999999e-20 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAAAAA AGAAATCGGG TGGCGGTGGA GCTGCACTAA TCGGCCTGAT TGTCTTAGGC ATGATAGTGA AGTATTGGTC TGTGTTTTTC CCCTTGACTG TTCTCGGTCT GATAATTTGG GGCATAGTAA AATTGTCAAA GAATTGGTCC ACAGCTGACT CGAAATCAAG CCCCTCGCAG ACAGCATTAG ACCTGACGCG GCCAGAAAAG ACAGTAGTTC CCCCAGCTAA GACAGCCACC CCAGTCCCGA CCATAAAAAT CGAAGTAAGT ACGAGTACGA ATTACCAGTC CCCGTCATCA TCTCCGAAGG AGTCGCCCTA CGTGCCACCA TATCGGACGG ATGCCACCAT TTTCGGGGCA AGTGCCCAGC AATCAAATTC TGTATCAGCT GACTCCTTTT GGGTTCCATG TGGTCGCACG ATTCAGGTTG CAGGCTATTC GATTCCGGGC GGGATGGTTT ATCATGGCAC TGGGTTGAAG TCCGTAAATC AGTACAACGA TGAACCGGCT CTGATAAGAC CGAAGCTTAA ACTAGATTCA GCCAACCCCG ACCGGGAAGG TCGTAACATA GGGTATTGGC CATGCTATTC TCAGATTCAC CCCACATCAC GAGCTGCTTT CCTGGAGTGG CTTTCCACTG GACGAAAAGA CCCCAACACC AACATCGGCT ACGTCTTCAT CTTTTTCTAT GGCCTTGAAA GACGAGCCTT TATCGATGCC AAGGAATCTG CCGCAGCACG TAACGAAATC CCCACTATTG CAACGGAAGT AAAACGACTC CTTTCTATCT ACGGCGAAAA CAACAGCTTC CGTGGATACG CAAGCAAGTT TCTGGACGCA ATACAGTCGT CCCAAGTGAA GGCACATCTC TACCGTACAG CGCCACAGAT TGACAATGGA TGCTCCTGGG AGATTCCGCT GACCCTTAAG ATAGCCCTTG GTCAGGTAGC CAATGATGGT GTTCCTCTGC CTGCCGAATG GGCATTGGCG TGGGCAGAAA ACGACCCGTC AATGCCTCGC AGAATGCCAA GCCAGCGTTG TCAGGCCGAG TTCCGCGAGC TATTCAAGAC TCGCTACAGC GAAAAAATGG GAGAAGGCTT GAAACTGAAG CCAAACAAAA CGAGGCTCAA GGCAAGTTAC TTCCCGGCCA CTTCATCTTT TAGGGGGAAT ATTGAAATCC CGATTTTGGA TTTGCCGGAT GTTACAGAGA CTACCGGCCC AGCTAACAAG ATTCGGGACA TTGCCAATGC CTGTACTGAC GAACTGGAAA GTTATAGCCG TTATCTTGGC CGGAACCCGG AAGGAAGGAA CTCGATTGAG GCCACGTCAT ACCTTCCCCA ACCCCTGTTG ACCAAACATG CCGGAAAAGA CTTCCAGAAG CTGAGTGACT GGCTGTCTGT GCAGGTTCAC GCCGATAAGC CCGAGTGCTT TTCCTTCTCA ATGCTGTTGG AGCACATTTC GTCAATCAAG CCTGATGGTT TCGGCAAGAA AGAGGCAACT GCTATCGCTA ACCTGCTGGC CAAGATGAAA ATCGGTATTG AACCTGACCC ACGCTTTGGC AATTTCATTC CTAAAATCGG CCAGGATGTC GTTCTGTTCA AAATCAGTGA CAACGCCCCA AGTTCTCCTT CAACCGAGTT TTCTGCCGCT GCCGTCGTGC TCCACTTGGC TTCAGCTGTC GCCAATGCCG ATGGCTTCAC TGACTCTACA GAAGAGCGTC ATCTGGAAGA GCATGTCGAA ACATGGCTGC ATCTGTCACC GGACGAAAAG ACAAGACTAC GGGCTCATAC TCAGTGGTTA CTTTCTGCCT TTCCCGGCAT GAATGGGGTC AAGAAACGGA TTGAAGTTCT CAAGCAAGAA CAGAAGGAAT CTTTAGGACG ATTCCTTGTC GGAGTCGCGC AGGCCGATGG CTATATCGAC CCCACAGAAA TGAAGACCCT CACGAAGATT TACGAAATGC TCAGCTTGGA TACTCAGAGC CTTTACAGCC ATGCCCATGC TGCTGCTGTG GAACCTGTAA CAGTACAAAC CTCCGATTTC GTGAAGCCGC AGGGCTACGC TATCCCAACA CCTCCTCCTA AACCTTGTGA GGGCGTATCT CTTGATATGA GCGCCATCGA AGCGAAACTT GCTGAAACCG TTGCAGTGTC GGCTATTCTG AGAAATATCT TTACGGATGA TGAGCCGGTC GCAACTCAGT CATCAGGTAC TGTGGTAACT ACACCGGAGG TTTCCGTTGC TGGACTCGAC CCTGAATCAT TTACCTTCAT GCAAGTATTG GCTTCCAAGC TTGTCTGGGC CAGGGAAGAG CTGGAAGAAC TTGCTGCAGA CCATAGCCTG ATGCTAGACG GCACACTCGA CACCATCAAC GATGCATCGT TTGACCATTT TGGTGGGCCG TTCTTCGAGG GTGACGACCC CATAGAAATC AATGCTGAAT ATGCCAAGGA GATATCCGCA TGA
|
Protein sequence | MSKKKSGGGG AALIGLIVLG MIVKYWSVFF PLTVLGLIIW GIVKLSKNWS TADSKSSPSQ TALDLTRPEK TVVPPAKTAT PVPTIKIEVS TSTNYQSPSS SPKESPYVPP YRTDATIFGA SAQQSNSVSA DSFWVPCGRT IQVAGYSIPG GMVYHGTGLK SVNQYNDEPA LIRPKLKLDS ANPDREGRNI GYWPCYSQIH PTSRAAFLEW LSTGRKDPNT NIGYVFIFFY GLERRAFIDA KESAAARNEI PTIATEVKRL LSIYGENNSF RGYASKFLDA IQSSQVKAHL YRTAPQIDNG CSWEIPLTLK IALGQVANDG VPLPAEWALA WAENDPSMPR RMPSQRCQAE FRELFKTRYS EKMGEGLKLK PNKTRLKASY FPATSSFRGN IEIPILDLPD VTETTGPANK IRDIANACTD ELESYSRYLG RNPEGRNSIE ATSYLPQPLL TKHAGKDFQK LSDWLSVQVH ADKPECFSFS MLLEHISSIK PDGFGKKEAT AIANLLAKMK IGIEPDPRFG NFIPKIGQDV VLFKISDNAP SSPSTEFSAA AVVLHLASAV ANADGFTDST EERHLEEHVE TWLHLSPDEK TRLRAHTQWL LSAFPGMNGV KKRIEVLKQE QKESLGRFLV GVAQADGYID PTEMKTLTKI YEMLSLDTQS LYSHAHAAAV EPVTVQTSDF VKPQGYAIPT PPPKPCEGVS LDMSAIEAKL AETVAVSAIL RNIFTDDEPV ATQSSGTVVT TPEVSVAGLD PESFTFMQVL ASKLVWAREE LEELAADHSL MLDGTLDTIN DASFDHFGGP FFEGDDPIEI NAEYAKEISA
|
| |