Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_1943 |
Symbol | |
ID | 6369343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010814 |
Strand | - |
Start bp | 2066629 |
End bp | 2067699 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642677355 |
Product | peptidase M42 family protein |
Protein accession | YP_001952179 |
Protein GI | 189425002 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0044983 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAAGG AATCACTGGT TTTTCTGGAA AAGCTGCTGG CCGCCCCCAG CCCCTCTGGC TATGAACAGC CCGCCCAGCG ACTGTTTCGT GACTACGTTG CTCCCTACTG TCAGGTCAGC AGTGATGTAC TGGGTAATGT CTACGGCTTT ATTGCAGGTC AGGGCAAGGA TTGTCCCAAG GTGATGCTGG TGGGGCATTC CGATGAAATC GGCCTGCAGG TCAAATACAT CGATGACAAG GGGTTTCTCT ATTTTGCCGC CATTGGCGGG GTTGATGCAC ATTTAACACC GGGCAAGGTT GTCCATATCC ATACGACAGA CGGCCCACTT CCCGGTGTTG TCGGCAAGCG CCCGATTCAC TTGATGGATA CCAAAGATCG CGAAACCGTG GTCAAGCTGG AGGCCCAGTA CATTGATATC GGGGCAAAAG ACAAGAAAGA GGCCCAAAAA CTGGTGCGGG TAGGGGATTG CATCACCTTT GAGAGCGGCT TTACCCACTT GCAGGGCGAC CGGGTTGCAT CCCGTGGCTT TGATGACAAG GCCGGTTCCT TTGTGGTGGC AGAAGTGTTG CGCCTGGTGG CTGCAGAAAA GAAAAAGCTG CCGGTTGATT TGTACGGCGT GTCATCGGTA CAGGAAGAGA TCGGCCTACG TGGCGGCACC ACCAGCTGTT ATACCATCAA CCCGGATATC GGCATCTGCG TTGAAGTGGA TTTTGCCACA GACCAGCCTG ATGTTGAGCG GAAACACAAT GGTGAGGTGG CCTTGGGTAA AGGTCCGATC CTGACCCGCG GAGCCAATAT CAACCATGCC CTGTTTGAAT TGCTGTACGC CACAGCCCAG AAGGACAAGA TCGCCGTACA ATTAACCGCC AATCCCCGTG CAACCGGCAC CGATGCCAAT GTAATGCAGA TTTCCCGGGG CGGGGTAGCT ACTGCCTTGG TAAAACTACC GTTACGCTAT ATGCATACAC CGGTAGAGGT GGTTTCATTG GGAGATCTGG AGCAGGCGGC CAAGCTGATT GTGGCGACGC TGAAGATGAT TACCGACCGT GGAACGTTTG TACCGCAATA A
|
Protein sequence | MRKESLVFLE KLLAAPSPSG YEQPAQRLFR DYVAPYCQVS SDVLGNVYGF IAGQGKDCPK VMLVGHSDEI GLQVKYIDDK GFLYFAAIGG VDAHLTPGKV VHIHTTDGPL PGVVGKRPIH LMDTKDRETV VKLEAQYIDI GAKDKKEAQK LVRVGDCITF ESGFTHLQGD RVASRGFDDK AGSFVVAEVL RLVAAEKKKL PVDLYGVSSV QEEIGLRGGT TSCYTINPDI GICVEVDFAT DQPDVERKHN GEVALGKGPI LTRGANINHA LFELLYATAQ KDKIAVQLTA NPRATGTDAN VMQISRGGVA TALVKLPLRY MHTPVEVVSL GDLEQAAKLI VATLKMITDR GTFVPQ
|
| |