Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1418 |
Symbol | |
ID | 8136746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1667331 |
End bp | 1668488 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869032 |
Product | protein of unknown function DUF185 |
Protein accession | YP_003021235 |
Protein GI | 253700046 |
COG category | [S] Function unknown |
COG ID | [COG1565] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 4.54059e-28 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGATG GCGCAACTAC TAAACTTGCC GAAATCATCC TTAACCGCAT CCGCACCAGC GGGGACATAA CCTTTGCTTC CTTCATGGAC GCGGCTCTCT ATGAGCCGGA CCTTGGCTAT TACACTTCCG CGGGCCGCAA AGTCGGAGCT GAGGGGGACT TCTACACCAG CATGAACGTC CACAGCGCTT TCGGACGGCT CATCGCTCAG GAGATCTGCC GGTTCTGGGA AGTCCTCGAC TCCCCCGCCT CTTTCACCAT CGCCGAGGCA GGCGCGGGCG GCGGACAACT GGCGCAGGAC ATTCTGGACG CCATCAGCGA AGACAACCCC GCTTTCTACA GCGGCCTCAC CTATCGCCTC ATTGAGAAGG AACCTTCCCT GCAGCAGGCC CAGGCCGCGC GCCTGTCGCG CCACGCGGAC CGGCTCGCCT GGAGCTCGCC GGACGAACTC GCAGCGGGGA CGCTCTCCTT CACCGGCTGC ATCATCTCCA ACGAACTTTT CGACGCCATG CCGGTACACA TCGTGGAACT GACCGAGGCG GGACTGCGGG AGGTGTACGT ATCCGCCGAT GACAACGGCT TCGTCGAAAG GCTGCTCCCC CCGTCCACCC CGGAGTTGGA GCAGTACCTG CGCAAGTACG AGGTAAGGCT CCTGCCGGGC CAGCGCGCGG AGATCAACCT CGCCGCTTCC GGCTGGATCG CACAGGCGGC AGCCACCCTC ACCCGCGGCT TCGTGCTCAC CATCGACTAC GGCTACCTCT CCGGGGAGCT CTACACGCCG CAAAGGAAAA ACGGGACGCT TCTCTGCTAC TACAAGCACT CCACCAACGA AAACCCCTAC CAACTGGTGG GTGAGCAGGA CATCACCACC CACATCAATT TCAGCCAGCT CATCGTCGAC GGCGAGGAGT CGGGGCTCAA GAAGGCGTGG TACGGCGAGC AGTACCGCTT CCTCCTGGCC GCGGGGCTCA TGGAGGAGCT GATCAGGCTG GAGGCGCAGG CCAAAGACGA GCAGGAAAGC CTCAAGCACC GACTGGCGCT CAAGAAGCTG ATGCTTCCCG AGGGGGGGAT GGGCGACACC TTCAAGGTGC TGATCCAGTC CAAGGGGGTC GACAACCCGC AGCTTCTCTG CATGAGGAAA TGGGGGATGG GGCTGTGA
|
Protein sequence | MADGATTKLA EIILNRIRTS GDITFASFMD AALYEPDLGY YTSAGRKVGA EGDFYTSMNV HSAFGRLIAQ EICRFWEVLD SPASFTIAEA GAGGGQLAQD ILDAISEDNP AFYSGLTYRL IEKEPSLQQA QAARLSRHAD RLAWSSPDEL AAGTLSFTGC IISNELFDAM PVHIVELTEA GLREVYVSAD DNGFVERLLP PSTPELEQYL RKYEVRLLPG QRAEINLAAS GWIAQAAATL TRGFVLTIDY GYLSGELYTP QRKNGTLLCY YKHSTNENPY QLVGEQDITT HINFSQLIVD GEESGLKKAW YGEQYRFLLA AGLMEELIRL EAQAKDEQES LKHRLALKKL MLPEGGMGDT FKVLIQSKGV DNPQLLCMRK WGMGL
|
| |