Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1241 |
Symbol | |
ID | 8136566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1446701 |
End bp | 1447945 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644868855 |
Product | amidohydrolase |
Protein accession | YP_003021060 |
Protein GI | 253699871 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 2.29686e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATAT ACGCCGCGTC ATATCTGCTT CCCATCTCCT CTCCCCCCAT CGCCGGCGGC GCCGTCGCGG TGGAGAACGG AGTGATAGCC GCCGTCGGAA CACTGCCCGA GGTCTCCACC GTCTGCGGGG CACCGGTCAC CGATCTGCCG GGGTGCGTCA TCATGCCCGG GCTGGTCAAC GCCCACACGC ACCTGGAACT GACCCATTTC CCCGCCTGGA AGCTGCGCAA GGACCTCGAC TATCTCCCCA AGCGCTACGT GGAATGGATC CAGCAGGTGG TGAAGATCAA GCGCGCCCTT TTGCCCGGGG AGATGGAGCA TTCGATCCGG GAAGGGATCC GCCTTTGCCT TGAATCCGGC ACCACCTCCG TCGGCGACAT ACTCTCCGAC TTTTCCTTGG CCCCTCTCTA CCTCGACACG CCGCTTGCCG GCAGGGTGTT CCTGGAGGCG ATAGGGCACG ACCCCACGCA GGGGGAAAAC CTGTTGCGGC GGATCGAGAC GACGCTCGAC ATCTTCGCGG GGAGCATCCT GCCGGCGATC TCTCCGCACA CCCCCCACAC CGTGTCGTCG CAGCTTTTGC AGGCCTTGCA CGCTCTGGCC GTAAGCCGTG CCATACCGAA GGCAATCCAC CTCTCGGAAA CCGCGGACGA AGCCTCCTTC ATGCATGACA CCACCGGGGA GATCGCCGAA CTCATCTATC CCATGGCGCA CTGGGAAGAG TATCTGCCGC ACCCGATGTA CACCACCTCC ACCCGTTTCC TTTGCGATCT TGGCGTCCTC GACCGCTCCA CCCTTGCCGT CCATGCCGTG CACGTCACCA TGGACGACGT GAGACTGTTA AAGGAAAAGG GTTGCAGCGT GGTCCTCTGC CCTCGCAGCA ACGACCGGCT TTTCGTCGGC ACCGCACCGC ACAAGCTCTT GAAGAAGGCC GGAGTTCCGC TCGCCCTGGG GACCGACTCC CTGGCGAGCA ACGACTCCCT TTCTCTTTGG GACGAGGTGC GCTACCTGCA GCAGCAGGCA CAAGGCGTCT TCAGCGCCGA AGAACTCATC GCCATGGCGA CCATAGGGGG AGCCCGGGCC TTGCAGATAG AGGCGAGCGC CGGTTCCTTG GAGCCAGGCA AGCGCGCCGA CTTCCAGGTT CTTTCCTTGG GCAGCGTCAG TGAGACTTCC GTCCACGCCG CCCTTCTGTC CAAGGGGCGT CTGGAGCAGG TCTACGTCGC CGGCGAGAGG TACCCGAAAC AGTAG
|
Protein sequence | MKIYAASYLL PISSPPIAGG AVAVENGVIA AVGTLPEVST VCGAPVTDLP GCVIMPGLVN AHTHLELTHF PAWKLRKDLD YLPKRYVEWI QQVVKIKRAL LPGEMEHSIR EGIRLCLESG TTSVGDILSD FSLAPLYLDT PLAGRVFLEA IGHDPTQGEN LLRRIETTLD IFAGSILPAI SPHTPHTVSS QLLQALHALA VSRAIPKAIH LSETADEASF MHDTTGEIAE LIYPMAHWEE YLPHPMYTTS TRFLCDLGVL DRSTLAVHAV HVTMDDVRLL KEKGCSVVLC PRSNDRLFVG TAPHKLLKKA GVPLALGTDS LASNDSLSLW DEVRYLQQQA QGVFSAEELI AMATIGGARA LQIEASAGSL EPGKRADFQV LSLGSVSETS VHAALLSKGR LEQVYVAGER YPKQ
|
| |