Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0576 |
Symbol | |
ID | 8135891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 704906 |
End bp | 706186 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868193 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003020408 |
Protein GI | 253699219 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0000000399762 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCAAGA AGCAGATCGA CCCGCTCGAG CGGGTGGCAG CAGAGATGAA GAAATGCGTG AAGTGCGGCG CCTGCCGCGC ACACTGCCCC GCCTTCAGCA CCTTCCAGCG CGAACCGGCC ACGGCCCGCG GCAAGGTGGC CCTGGCGCAG CACCTTTGCA AGGGGGACAT CACGCTCGAC GACGGCACCT ATTCGGCCAT GTCCAAGTGC CTCCTGTGCG GCAGCTGCGT GGACAAGTGC CCGAATCAGG TCCCGACCGA CGAGATAGTG ATCGCCGCTC GCGAGGCGCT CGCCCAAAGG CGCGGGCTCA CCACCTTTCA TAAGGCTGTG GGACAGGTGA TCAGGAACCG CAAGGTGATG AACTTCGGCG CGCTCGCGGC CGCCATCCTC GGCCCCCTCT TCTTCAGGAA GGTCCCGGCC ACCTCGGGGC TCAGGCTCCG TTTCCCCATG CCGTTTATCG GGGGGAGCCG GCACATCCCG CAAATCGCCA AGAAGCCGTT TATAAAACGC CACCCCGAGG TGATCCAGGG GGAACCGGGC AAGCCGAGGA TCGTCTTCTT CGTCGGCTGC ATGACCAACT TCGTCTACAC CGAGATCGGC GAGGCGACCC TCGCCCTCTT CCGCCACCTC GGCTGCACCG TCATCATTCC CAAGGGGCAG CAGTGCTGCG GCCTGCCGGG GATGTCGGGC GGCGACCTAA ACACGGTGCG GGAGCTGGCC GAGATGAACC TCGCCGCCAT CGAGGCGCAC CAGGCGGACT ACGTGATGAC CGCCTGCGCC ACCTGCGGCG GCGCTCTGCA CAAGCTCTAC CCGCTGGTGG TGGGAAAACG GAACCCGGAA CTGAAAGAGC GGCTCCAGGC CCTGGCCGAC AAGACGGTGG ACGCGGCGGT GCTCTTGCAG AAACTGGGTC TCACCCCGGA AGAAACCGGG ACCGGCTCCG GCATGCGCAT CACCTACCAC GACCCCTGCC ACCACAGAAC CGCCGGCATC GCCAAGGAGC CGCGCGCCCT GCTCAAGAAG ACGCCGGGAC TGGAGCTGGT GGAAATGGAA GGCGCGGATC GCTGCTGCGG CCTGGGCGGA ACCTTCAACG TTTACCATTA CGAAAACTCC CTCGACATCA ACGCAGGCAA GAGCGCCGCG ATCATCGCAA CCGGCGCCGA CGCTGTGGTT ACCGGCTGCC CCGGCTGCAT CATGCAGCTC TCCGACGGCC TGAAACAGGC CGGAGATAAA ACGAGGGTAT TGCATACCGT GGAACTTCTG GCCCGCAAAA TCAGGCGCTG A
|
Protein sequence | MTKKQIDPLE RVAAEMKKCV KCGACRAHCP AFSTFQREPA TARGKVALAQ HLCKGDITLD DGTYSAMSKC LLCGSCVDKC PNQVPTDEIV IAAREALAQR RGLTTFHKAV GQVIRNRKVM NFGALAAAIL GPLFFRKVPA TSGLRLRFPM PFIGGSRHIP QIAKKPFIKR HPEVIQGEPG KPRIVFFVGC MTNFVYTEIG EATLALFRHL GCTVIIPKGQ QCCGLPGMSG GDLNTVRELA EMNLAAIEAH QADYVMTACA TCGGALHKLY PLVVGKRNPE LKERLQALAD KTVDAAVLLQ KLGLTPEETG TGSGMRITYH DPCHHRTAGI AKEPRALLKK TPGLELVEME GADRCCGLGG TFNVYHYENS LDINAGKSAA IIATGADAVV TGCPGCIMQL SDGLKQAGDK TRVLHTVELL ARKIRR
|
| |