Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3565 |
Symbol | |
ID | 8138937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4140021 |
End bp | 4143353 |
Gene Length | 3333 bp |
Protein Length | 1110 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871184 |
Product | hypothetical protein |
Protein accession | YP_003023344 |
Protein GI | 253702155 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 121 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAA AAATGGTCAT GATCCTAGTG TTGCTTCTGG CCATGCCGGC CTTTGCCAAC GCATGGTATG TCAATGCGAA GACTTCGCCG ACCAGCGGTG CGGGCACCAT CACCCCGTCG GGCAACAGGA CCTATGCCGC GGGGGTGAAC AGCGAGGAGT TCACCGTCAC TCCGGCTCCG GGCTACACCC TTTCCCGCGT CACGCTCGAC GGCGTCGCCA TCGCCCCCAA CGCAAACGGG AAGTACGTCG CACCTTACGT CTCGACCCTG ACCTGGCGCT ACATGGTGGC AGTCTTCTCG GCCGGCACGG TGAACATCAC TACGAGCGTT ACCAGTGGGG GCGCCATTCG CGAGGCCAAC TATCTCTCGC TCACCTCCAT CCCGGTCGGT TCCACCCGGA CCCTCTTGGT ACTTCCGAAC AGCGGCTACG AAATCAGCAC CTTGACCGCC AGCGGCACCC CTTCCGTAAC CGTCCAGGGG GATGGCTCCC GGTTGGTCAC CTACAACAAC CTGCAGGCGA ACCAGAGCGT CACTGCCGGC TTCTCGCTGA TCCCGATCGT GGTCGCCAAC GCCGGCAGCG ACGTCACCAC TACGGGCACA GGCGCGGCTT ACGCGACGAC CCTCTTCGGC AGCAACAGCA GCAGCAACCA GGGGGCGATC AACTACCAGT GGAGCGGACC GGCGGCTTTG ATGTTCGGCT CGCCCACTTC CGCCAATACC ACCGTATATT CGGACATCCC CGGGGATTAC ACGGCGACAC TCACCATCAC CTCGAACGGC ATCACCCGTT CCGATACGGC CATCGTGCAT GTCGTCACTC GGAACTCCTA TCTCGTGTCT CAGTGCACCA CGTGCCATTC CGGAAACACC ACGGCGCTGG TCGGCCTTTA CAACAGCTCG CCGCATCTTG AAACCAACGC CTGCCAGGGC TGCCACACCG ACAGCCCGCA CGTGGCGCTG CCGTCGCCCA ACGTCTGCGC CGATTGCCAC ACGGACACCT CGCGCCATCC CTTCGAGATC ACCGACACCT GCACCTCCTG CCACAACTCG CACTCGACCG TTGTCGGGGC GACGGCGATG CCGGACGGTT TGCATTACAA CAACATCACC ACCGGCATGT ACCCCGCCTC CTTCGTGACC TCGCGCGCCG CCTGCGCCAA CTGCCACAAC AACACGATTT CCAACAAGGC GATCAGGATA CAGTGGCGCG CGGCGCGCCA CGCTAACATC ACCTCCATGG GGTGGATCGC CAGGGACTTC AAGACCCTTA ACGGCTGCGT GCGCTGCCAC ACCACCACCG GCTTCATCGC CTACTCCACG GGCAAGGTGA CGGCGGCGTG GGGCGTGGCC GCAGACAAGA CCAAGGAAGT ACTTACCTGC ATCGGCTGCC ACAGCGACAG CGTGTCCGGG GCGGTGCGCA CCATGGCGCA GGTTGCGCCG TACCCCGACA ACAGCTTCGT CACCCCCGAC ACCGGCAAGT CCAACGTCTG CCTCCCCTGC CACTCGGGCA CCAGCAGCGG CGAGAGCATC AAGGCGCTTT TGCAGGCCCA GGCAGACTTC GGCAACACAG GCTTCGTGAA CCCGCACTAC AAGGCGGCCA CCGGCTCCCT CTACGGCGTG GCCGGCTACC ACTTCTCGGG GCGCAGCTAC ACCACCGAAG CGACCCACAA CCATCTGGGC ATCAGCGACG GCGGCGGGGC CTGCGTCTCC TGCCACAGAA ACAGCATGAA CGGCCACACC TTCCAGGGTG AGGTGACCCC GGTCTGCGCC ACCTGCCACG GCACCAGTCT GGACGAGGCC TCCCTGCACG TGGACCACAA CTACTTCCTG AACGCCCTCG AGGTGCTGAG GGCTCAGTTG GCCGCCAAAG GCTACGCATA TTCGCTCACC AGCCGCAGCT TCAACGCCAC CAACTGGGGC GCCGGTCAGG GGGGCGCCGA CACAATGGGC GCCGCGTTCA ACTACGCGCT GCTCGTCTCC GAGCAGGCCG CCTTCGTGCA CAACCCGAAA TACGCCCAGG AACTGGTGAT CGATTCCATC GATTACCTGG ACAACGGCCA GTTCGACGAT TCAGTGGCCG GCACCGTGCA GGCCCTGCTT GACTCGGGAG CGATCAGCCA GGAGGTCGCC GACAGTTTCG GCACCTACAA GCAGAAGAAC ATCTGCCTCT CCTGCCACGG CGGCGACGCG ACCACCTCGC GCCCCATGGC CAGCAACGGT CATCCTTCGC ACCTGAGCGG GGCCTGGGGG CCGCAGGATT ACCTGCGCAC GCAGAGCTCG AGCTGCGAGC CCTGCCACGG CAACGACTTC GCCCTGCACT CCAACGGCAC CGTCAACGTG TCGAGCGACG CGGGCTCCTC CTGCGCAAAC TGCCATGCCG GTTCGGTTCC CGCCTGGAAC TCCACGGCCC GGATCGCCTG CGAGGTCTGC CACTCGGCGA ACCCGGCAAG GCTTCCCAAC GGAGTGGCGG CGCCTTCCAA GGAGAGTTTC GCCACCTCCG GCCACGGGCA GTTCGGCGCC AGCAACCAGT GCACCATCTG CCACAACCCC GACAGCCGCC ACATCTCCGG CAGCCTCGGA ACCTACAAGA GGCTGAAGCT GCAAAACGAC AACAACCTCT GCGCCTCCTG CCACGACAGC GTGGTGGCGC AGCAGATGCC GACTCACCAT GGCCTTGCCT GCGTGCAGTG CCACGATCCG CACGGAAACG GCAACATCAA GATGGTACGG GGCACGATCG GGACGCAGAG CATCACCTAC CTGAACTCCT TGAACAACTT CGTGGACCAG ACTACCAACA AGGGGCTTTG CCAGGTCTGC CACAGCTCCA CCCGCTACTA CCGCGCGGGG ATCAGCGAAA GCAGGCACTA CACTACTGGG TGCCTTGGCT GCCACTTCCA CTACAACCCC GACGGCGCCT TCTTGCCCAG CGGCGGCGCC TGCGACTCCT GCCACGGTTA TCCGCCCGCT CCGAAGAACA CGGCGACCAG CTTTGGCAGC GACTCCAACT GGGCCAACGC TCGCTACGAG GACTATTCCG GAGGCGGCGG CGCGCACCTG GTGGCGGCCC ACGTCTCCCC CTTCGCCACG ACGACCGACG GCTGGAGCAA CTGCACCGTC TGCCACAACG GCGGCTACCA TGAAATGACC ACACCGGTGG CGGAACACAT CGGCAACGTC ACCGTGATGG TGGACAACAG CCTGCGCTTT GCCGACAGCT TCACGGTCTA CACCGGCGCG AAGCTGACCA ACGCGGGGCC GAACGCCACG GGAAGCTGCT TCAACATCGC CTGCCACATG AGCCCGTCAA TAAGGTGGAG TACGGAAAGA TAG
|
Protein sequence | MFKKMVMILV LLLAMPAFAN AWYVNAKTSP TSGAGTITPS GNRTYAAGVN SEEFTVTPAP GYTLSRVTLD GVAIAPNANG KYVAPYVSTL TWRYMVAVFS AGTVNITTSV TSGGAIREAN YLSLTSIPVG STRTLLVLPN SGYEISTLTA SGTPSVTVQG DGSRLVTYNN LQANQSVTAG FSLIPIVVAN AGSDVTTTGT GAAYATTLFG SNSSSNQGAI NYQWSGPAAL MFGSPTSANT TVYSDIPGDY TATLTITSNG ITRSDTAIVH VVTRNSYLVS QCTTCHSGNT TALVGLYNSS PHLETNACQG CHTDSPHVAL PSPNVCADCH TDTSRHPFEI TDTCTSCHNS HSTVVGATAM PDGLHYNNIT TGMYPASFVT SRAACANCHN NTISNKAIRI QWRAARHANI TSMGWIARDF KTLNGCVRCH TTTGFIAYST GKVTAAWGVA ADKTKEVLTC IGCHSDSVSG AVRTMAQVAP YPDNSFVTPD TGKSNVCLPC HSGTSSGESI KALLQAQADF GNTGFVNPHY KAATGSLYGV AGYHFSGRSY TTEATHNHLG ISDGGGACVS CHRNSMNGHT FQGEVTPVCA TCHGTSLDEA SLHVDHNYFL NALEVLRAQL AAKGYAYSLT SRSFNATNWG AGQGGADTMG AAFNYALLVS EQAAFVHNPK YAQELVIDSI DYLDNGQFDD SVAGTVQALL DSGAISQEVA DSFGTYKQKN ICLSCHGGDA TTSRPMASNG HPSHLSGAWG PQDYLRTQSS SCEPCHGNDF ALHSNGTVNV SSDAGSSCAN CHAGSVPAWN STARIACEVC HSANPARLPN GVAAPSKESF ATSGHGQFGA SNQCTICHNP DSRHISGSLG TYKRLKLQND NNLCASCHDS VVAQQMPTHH GLACVQCHDP HGNGNIKMVR GTIGTQSITY LNSLNNFVDQ TTNKGLCQVC HSSTRYYRAG ISESRHYTTG CLGCHFHYNP DGAFLPSGGA CDSCHGYPPA PKNTATSFGS DSNWANARYE DYSGGGGAHL VAAHVSPFAT TTDGWSNCTV CHNGGYHEMT TPVAEHIGNV TVMVDNSLRF ADSFTVYTGA KLTNAGPNAT GSCFNIACHM SPSIRWSTER
|
| |