Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1035 |
Symbol | |
ID | 8136357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1216481 |
End bp | 1218466 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644868646 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003020854 |
Protein GI | 253699665 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 89 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGCGA CGCGGGAAAT TTACTGGAAC GTCAATCACG CTCTGATCTG GGTGATGTAC CTCTTCGCCT TTCTGGCCCT GGCGGCTTGC GCCTGGGGCT TTTGGCGGCG TCTCCCCATG TACCGGCAGG GAAAGCAGCC GCTTAACCGG CTGGACCGGC TCCCCGAGCG CGTCCGGCAC TTTCTCAAGG GGATGTTTTC GCAGGTGAAG GTGCTGCGGG TGCCCGAGCC GGGGACGCTG CATGCATTTT TCTACTGGGG GTTCCTGCTC CTTTTCATCG GGACCCTCTT GATCATGCTG CAGGCCGACT TCACCGAGCC CTTGTTCGAC ACCGTGTTTT TGCAGGGGAA TTTCTACCGC GGCTATTCCC TGGTGCTGGA TCTGGCGGGG CTAGCGGCGA TCGTCATGCT GGGGGGGCTT TTGGTGCGCC GCTGGTTCGT GAAGCCGAAA GGGCTCCCGA CCGGCGGGGA CGATTACCTG GCGCACGCCC TTCTCTTCGC CATCCTCGTG ACGGGGTTCG TCGTGGAGGG GCTCCGCATG GCCTCGACCG AGATCGGGAT CAACCCGGAA CTGGCGCGCT GGTCGCCGGT AGGGGGGCTC TTCGCCCGTC CCTTCGTCGG GATGGATCTT GGGCGGCTTT CCCTGATCCA CAAGACTCTT TGGTGGGGGC ACCTGTTCCT GGCGCTCTTC TTCATCGTTG CCATTCCCTT CACCAAGCTG CGGCATCTGT TCACCACGCC GGTCAACTAC CTCTTTACCG ACTTAAGGCC CAAGGGGGCG ATCGCGACCA TCGACCTGGA GGACGAGGGG GCGGAGCAGT TCGGCGTCGC CAAGGTGACG GATTTTTCCT GGAAGGACCT CTACGACCCC GATGCCTGCA CGGTCTGCAA GCGCTGCCAG GACCGCTGCC CGGCCTGGAA CACGGAAAAG CCGCTTTCCC CGATGCATGT GGTGCTGCAG ATAGGGGAGG TGGCGGCGGC GACGCCGCAG GCGGATCTCT GCCGGACCGT CACCGAGGAG GTCCTTTGGG ACTGCACCAC CTGCCGGGCC TGCCAGGAGA TCTGCCCGGC CGAGATCGAG CATGTGAACA AGATACTCGA GATGCGCAGG AACCTGGCGC TCATGGAAGG CTCCTTCCCC GGCGAGGAGG TGCGCGTGGC CATGGCCAAC TACGAGGTGA ACGGCAACCC CTTCGGCATG GCCTACGCCG AGCGCGGCGC CTGGGCCGAA GGCCTCGACG TCGCCGTCAT GGAGAGCGGC GCCGCGGTCG ACGTCCTCTA CTTCGTCGGC TGCTACGCCT CCTTCGACCG CAGGAACCAG GAGGTGGCCC GCGCCTTCGT GAAGCTCTGC AACGCCGCCG GCGTCAGGGT CGGCATCCTC GGCAAGGAGG AGAAGTGCTG CGGCGAGCCC CCCAGGAAGC TCGGGAACGA GTACCTGTAC CAGGGGATGG CGCAGGAGAA CATCGAGAAG ATCAAAGGGT ACGGGGTGCC GCGGGTGGTG ACCACCTGCC CGCACTGCTT CAACACCCTG GCCAGGGATT ACCGCGATCT GGGCTTCGAC ATCCCGGTCG AGCATTACAC CACCTTCCTC CATGACCTGG TGCAGCAGGG GAGGCTGAAG CTGAAAGCGG AGCCGTTTGC CTGCACCTAT CACGATTCCT GCTACATAGG GCGCTACATG GACATCTTCG AGGAGCCGCG CGAGCTTTTG GATCGCGCCG GCGCTAGCAT CGCCGAAATG GGAGCGAGCC GCCTGGAGAG CTTTTGCTGC GGCGCCGGCG GGGGGCGCAT CCTGGCGGAG GAGAAGCGCG GCACGCGGAT CAACGTGGCG CGGGTGCGGA TGGCGCAGGA AACCGCCGCT CCCATGCTGG TTTCCAACTG CCCGTTCTGT CTCACCATGT TCGAGGACGG CATCAAGACC GGAGGCGCTG AGGGGACGGT CGCCGCAAGG GATCTGGCGG AGATTCTCGC GGAGCGGATC GCCTGA
|
Protein sequence | MEATREIYWN VNHALIWVMY LFAFLALAAC AWGFWRRLPM YRQGKQPLNR LDRLPERVRH FLKGMFSQVK VLRVPEPGTL HAFFYWGFLL LFIGTLLIML QADFTEPLFD TVFLQGNFYR GYSLVLDLAG LAAIVMLGGL LVRRWFVKPK GLPTGGDDYL AHALLFAILV TGFVVEGLRM ASTEIGINPE LARWSPVGGL FARPFVGMDL GRLSLIHKTL WWGHLFLALF FIVAIPFTKL RHLFTTPVNY LFTDLRPKGA IATIDLEDEG AEQFGVAKVT DFSWKDLYDP DACTVCKRCQ DRCPAWNTEK PLSPMHVVLQ IGEVAAATPQ ADLCRTVTEE VLWDCTTCRA CQEICPAEIE HVNKILEMRR NLALMEGSFP GEEVRVAMAN YEVNGNPFGM AYAERGAWAE GLDVAVMESG AAVDVLYFVG CYASFDRRNQ EVARAFVKLC NAAGVRVGIL GKEEKCCGEP PRKLGNEYLY QGMAQENIEK IKGYGVPRVV TTCPHCFNTL ARDYRDLGFD IPVEHYTTFL HDLVQQGRLK LKAEPFACTY HDSCYIGRYM DIFEEPRELL DRAGASIAEM GASRLESFCC GAGGGRILAE EKRGTRINVA RVRMAQETAA PMLVSNCPFC LTMFEDGIKT GGAEGTVAAR DLAEILAERI A
|
| |