Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1352 |
Symbol | |
ID | 8136680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1593973 |
End bp | 1595163 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644868966 |
Product | domain of unknown function DUF1745 |
Protein accession | YP_003021169 |
Protein GI | 253699980 |
COG category | [S] Function unknown |
COG ID | [COG3287] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1.3322200000000001e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGTACGT CAGTTGGAAT AGGCTTCAGC ACCCGCAAAA GTCCGATGGA TGCCGGGAGT GAGGCGGCCC GGACGGCACT GGAGCAGGCC GGCATCGCCA AGCCGGATCT CGTCATCGTC TTCGCCACCG TCGGCTATCA GCAGCAGCAG TTGATCGGCG CCATACGCGA GACGACGTCC GGAGCCCCGC TTTGCGGTTG TTCCGGCGAG GGGATCATCG TGCAGAATGC CGTGGTGGAA ACGAATTTCG CAGTAGGTGT CATGGTGATC GCCTCCGACG AAGTAAGCTT CGAAATCACC TCCGTCACCG GGATAGGGGA ACAGCCTCGC GCCGCCGGCA AGAAGCTGGC AGCTAAGATC ACTCCGCTTC TGGCAGACGA CAGCCGCGCC TTTTTCCTGC TGGCCGACGG CCTGGCATTT GATTTCGATC CCTTTGTGGC AGCCTTCGAG GCGGCTCTCC CCTCCTCCGC CAGGCTCCCG ATTTTCGGCG GACTGGCCGC GGACAACTGG GCTTCCCGCA GGACCTACCA ATACCACGAT GACGAGATAC TGAGCCAAGG TGCCTGCTGT ATCGTCATGT CGGGCAGGTG CCGCGTCGCC TGGGGCATCA ACCACGGCTG CGTTCCGGTC GGGGCGAAGC GAACCATCAC GCGCAGCAAG GGAAACATCA TCTACGAGAT CGATGGGACC CCCGCCTTGG ATGTGCTCAA GGAATACGTA GACGAGGACT GGGCGGTCAA GTGGAACAAG GCGACGCTGA ACCTGGGTCT CGGATTCAAG ACACCGGAGC ACCTAAGGGA GCGGTACGAC GAATACTACG TGCGCTACAT AATGGCAAAG GACGACGAGG CAGGCTCCGT GACCATCCAG TCCGACGTCA CGGACGGAAC CCCCCTCTGG ATCGTCCGCC GCGACAAGGA ACTGATGATG GAAGGCCTCA AGTCCATTCC ACGACAGATA AAAGATGACC TTGCCGGAGA AAAGCCCAAA TTAGTGTTGC AGTTCGAGTG CATGGGACGG GGCCGGGTCG TATTCAAGGA GCAGGAACGC ATGGATCTGG TGAAATGCCT CCAGCGCGAC CTGGGCGAGG ACCTGCCTTG GCTCGGATTT TATACCTATG GGGAAATAGG CCCGATCAGC AGGTACAACT GCTTCCACAA CTTCACCTCA GTCGTAGCGG TAGTTTACTA A
|
Protein sequence | MGTSVGIGFS TRKSPMDAGS EAARTALEQA GIAKPDLVIV FATVGYQQQQ LIGAIRETTS GAPLCGCSGE GIIVQNAVVE TNFAVGVMVI ASDEVSFEIT SVTGIGEQPR AAGKKLAAKI TPLLADDSRA FFLLADGLAF DFDPFVAAFE AALPSSARLP IFGGLAADNW ASRRTYQYHD DEILSQGACC IVMSGRCRVA WGINHGCVPV GAKRTITRSK GNIIYEIDGT PALDVLKEYV DEDWAVKWNK ATLNLGLGFK TPEHLRERYD EYYVRYIMAK DDEAGSVTIQ SDVTDGTPLW IVRRDKELMM EGLKSIPRQI KDDLAGEKPK LVLQFECMGR GRVVFKEQER MDLVKCLQRD LGEDLPWLGF YTYGEIGPIS RYNCFHNFTS VVAVVY
|
| |