Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2714 |
Symbol | |
ID | 8138056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3161181 |
End bp | 3163175 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870318 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003022508 |
Protein GI | 253701319 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 0.793769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCGC AAACCGCACT CTTCACGACG CTACTGGTCG CCTCGCTCGC CTTTTTCTGC TGGAGCGTGT ACCGCCGCTT TTCACTGGTC TGCTTCGGGC AGCCCGAGGA AAGGCTCGAC CACCCCGGGC GCCGCCTGCA GGAGATGCTT TTGTACGCCT TCGGGCAACT GCGTGTGGTG AAAAAGCCTT TCGGCCTGAA CCACTTCGTC ATCTTCTGGT CCTTCCTGAT CCTGGCCATC GCCAACGCCG AATTCCTTTT GAACGGCGTG TTCCCGTCGG CGAGCCTCGC CGTGCTGCCG CAGGGGTTGC ACCACGGGCT TCTCGTGCTC TTCGACCTGG TCTCGCTTCT GACTCTCGTA GCGATCGCGC TCTCCTTCGG CAGGCGCCTC ATCGTGAAGC CCCCCTATCT CGACTCCCTC TACGTGAAGG GGAGTAGCCC GGAGGCCTTC GTCATCCTCT CCTTCATAGC GCTCCTGATG CTCGCCTACT TCGGGATGCA CGGGGCGCAG ATCGCGCAGG GTAAGGAGGC GGTCTCGGCT GCGATGCCGG TATCGAGCTT CGTCGCGTCG CTTTTATCCC CCTACCCGGG GCTTCTGGGG ACGGTCGAGA CCGTCTCCTG GTGGCTGCAC GCGCTGGTGC TTCTAGCCTT CATCTGCTTC TTGCCCCATT CCAAGCACAT GCACATCTTG ACCGCCATCC CCAACTGCTA CCTGGGTAGC CTCGACTGGC CGGCGACCCA GCCGCGCGAG AAGTTCGAGA AGGGGGCCGA GTACGGTGCG GGGAGCGTGG AGCGCCTCAC CTGGAAGGAC CTCTTCGACT CCTTCTCCTG CACCGAATGC GGTCGCTGCC AAGCCGCCTG CCCCGCTGCC AGCACCGGGA AGGCCTTGAA CCCGCGCCAG ATCGTGCACG CCATCAAGAC CAACCTCCTG GAGAACAGCC ACGCGCTCAG GGAGGGGAGG AAGGGGACGC TGCCTCTCAT CGGCAACGAA GGGGAGGGGA CCAACACCGA GGAGGCGATC TGGGACTGCA CCACCTGCGG CGCTTGCATG GAGGCCTGCC CGGTGCTGAT CGAGCAGATG CCCAAGATCG TCAAGATGAG AAGGCACCTG GTGCAGGACG AGTCGCGCTT CCCGGAGGAA CTCCTGAACC TGTTCGAGAA CATGGAACAG CGCTCCAACC CCTGGGGGAT CGCCCCCAGC GAGCGGAGCA AGTGGGTCTC CACCCTGGAG GTGAAGCCGT TTGTCGCAGG GGAGACCGAG TATCTCCTCT ACGTCGGCTG CGCCGGTTCC TTCGACTCGC GCGCGAAGCA GGTGACCGTG GCGCTTGCCA GCGTGCTGAA CGCGGCCGGG GTCTCCTACG GGATACTGGG GAAGGAGGAG AAGTGCTGCG GCGATTCGCT GAGAAGGCTC GGCAACGAGT ACGTCTTCGA GAAGATGGCC CTGGAGAACG TGGAACTCTT CCGCGAGAAG GGTGTCACCA AGGTGATCAC CCTCTGCCCG CACTGCCTGA CCACGCTTAA AAACGACTAC CGCCAGTATG GGCTGGAGCT GGAGGTGCTG CACCAGTCCC AGCTGATCGC CGAGCTTCTC GCATCCGGCC GCATCAAGCT GGACGGCTCG GAGAAGAGCC TCGGCAAGAT CACCTACCAC GACCCCTGTT ATCTCGGGCG CCACAACGGC GTGTTCGACG CGCCGCGCGG CGTGATTCAA GCGGCTACAG GAAGCGCCCC GCAGGAGATG GAGAGAAACG GCAGGAACTC GTTTTGCTGC GGCGCCGGCG GCGGGCGCAT GTGGATGGAG GAGTTCACCG GCGAGAGGGT GAACCACGCC CGCGTCGCCG AGGCGCTTGA AGGCTCCCCC GACACCATCT GCGTCGCCTG CCCCTACTGC ATGACCATGA TCGAGGACGG CCTGAAGGAC AAGGGGGCTG GCCAGGTGAG GGTGAAGGAC GTGGTCGAGG TGGTGGCGGA AGGGCTTTTG CACCGCAAAG GGTAG
|
Protein sequence | MMPQTALFTT LLVASLAFFC WSVYRRFSLV CFGQPEERLD HPGRRLQEML LYAFGQLRVV KKPFGLNHFV IFWSFLILAI ANAEFLLNGV FPSASLAVLP QGLHHGLLVL FDLVSLLTLV AIALSFGRRL IVKPPYLDSL YVKGSSPEAF VILSFIALLM LAYFGMHGAQ IAQGKEAVSA AMPVSSFVAS LLSPYPGLLG TVETVSWWLH ALVLLAFICF LPHSKHMHIL TAIPNCYLGS LDWPATQPRE KFEKGAEYGA GSVERLTWKD LFDSFSCTEC GRCQAACPAA STGKALNPRQ IVHAIKTNLL ENSHALREGR KGTLPLIGNE GEGTNTEEAI WDCTTCGACM EACPVLIEQM PKIVKMRRHL VQDESRFPEE LLNLFENMEQ RSNPWGIAPS ERSKWVSTLE VKPFVAGETE YLLYVGCAGS FDSRAKQVTV ALASVLNAAG VSYGILGKEE KCCGDSLRRL GNEYVFEKMA LENVELFREK GVTKVITLCP HCLTTLKNDY RQYGLELEVL HQSQLIAELL ASGRIKLDGS EKSLGKITYH DPCYLGRHNG VFDAPRGVIQ AATGSAPQEM ERNGRNSFCC GAGGGRMWME EFTGERVNHA RVAEALEGSP DTICVACPYC MTMIEDGLKD KGAGQVRVKD VVEVVAEGLL HRKG
|
| |