Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_1902 |
Symbol | |
ID | 6367019 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010814 |
Strand | + |
Start bp | 2028176 |
End bp | 2030071 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642677310 |
Product | transglutaminase domain protein |
Protein accession | YP_001952138 |
Protein GI | 189424961 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.37601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAGAT CGCTACTGAC AACCTGTGCT ATTGTTGCCT GTTCTGTTGT TATGATGTCT CTGGTTGCCT CTGCCAAGAC TCTGATACTT GAAGGTAACC TTGATGGTGC CGTTGCCATG CGCCAGTCCA TGGAGTTTTC AGTCAATAAG GGCACGGTCT CCAGTTTTTC ATTCAAGTTT GCCTTGCCGG CCTCTTTTTC AACCAGGACT GTTAGCCAGG GGGTTGACGG CCTTGATATT GGTCTGTCTC CTCAACCCAC CAGTTCAACG GTTGAAACCG ACCGTTTTGG CAACAAATTT CAGCGGGTCA GCTGGAATAA CGTTAACCAG GATATCCGGG TTAATCTGAA CTATACTGCC AATGTCCGTA CGCAGTTGAC CTCGCTGGAG AGCAAGACAC CTTTTCCGCT TGGCAGTCTT GCCTCAAGGG AGTCACTCTA TCTGCAGCAT ACCGAGATGG TGCAGGGGGA TGCCGAGATC AAATCCCTGG CCCGTCAGTT GACCAGTGGT GTCAAGAACG AGTATGAGGC CGTATCTGCC ATCATTAACT GGGTCACCGA CAATATCAAA TATACCTTTA ACCCGCCCCA GTATGATGCC TCCTACACCC TCTCAACCAG AAGCGGCAAC TGTCAGAACT TTGCCCACCT TTCCATTGCC CTTTTGCGCA GTGTGGGAAT CCCCGCCCGT ATTGTCGGCG GCATAACCCT GAAAGAGGGC TGGAAGGTGC CGATTGATGC AAAAAACTCC ATTGTCCAGA GCATGGGGCA GGGCGGACAT GCCTGGCTGG AGGTCTACTT CCCCGACCTG GGCTGGTTGC CGTATGACCC TCAACAATCA CGTCAGTTTA CCTCATCGCG CCATATCAAA CAGACCCATG GTCTGGACTC CCGGGATATT AATGACACCT GGAAGGGCGG CCCCTATCTG CCGGCCTACA GTGAGCTGAT AGAGGGGCGT TATACCACTG ACGATATCAA GTTGAAGATG AAGCGCTATG CCAGCGCTCC CCGTCCCTAT ATCCTGAGTA ATGCTGTGAT GGCTGGTTCA GCCGGCGTCG TGGATACAGC AGATTCCGGT CGGGAGACAC CTCCGGCTGC CAAGCCGCCC CGTCCGGAAT CGAGGACAAA TGTTGCATCA GCCAAGCCGG CAACACCTCC GCAAGGATCA GGTGCCGGGC CGGCAGAGAC CCGTCTGGTC AGTGATGACG AGGCATCTGA TGAGGATCAG CCCAGACCGG TCAAACCCAA ACCTCCCGTA AAGCCGGTCA AGCCGCCTGT AACCAAACCG CCCAAGGTCG CTACCAAACC ACCGGTAGTT GTTGAGCCCG GCCACAAGAA ACCACGTCCT GGTACCATGC TGGCTTTTGG TAACATGGAT TTTCCTGCCC TGGTTAACCT GTATAATGTC AAAGGTGATA CCGGTACCCG TGTCTTTGAG CGTGAGACCT CGGAATATGT TACATCAAAA CATATCTTTG CCCAGGCTAT TCAGGTCAAA GACCGGATGA GCCTTGAAAA AATCTCACTT GCCATGCGAA AATTCGGTGG TGATGGTGCG GTGTACATAG ATCTGGTCAA GGATAAGGGC GGCAAACCAG ACATCATGCA GGGAATCCGC TCTAACCTGC TTGACCTGGA AAAGATAGTC AGAAGACCGG GTTATTACTG GGTAGATTTT TCATTTCCGC CTGATGGGAA CAAACCGCAA CAGCTTGCTC CGGGCAAGTA CTGGATTGTA TTCCGTGCCT CAGGCGAGGC GATCATGAAC TGGTTTTTCA CACCGGGTAA ACCGTATGGC GAAGGTGATG ATACCCGTTC CACGGCAAAG GGCTTCCAGT GGGAAGATAT CCTGAACTAT GACTTTGTCT TTAAGGTCAG CGGCAAGGCG GTATGA
|
Protein sequence | MRRSLLTTCA IVACSVVMMS LVASAKTLIL EGNLDGAVAM RQSMEFSVNK GTVSSFSFKF ALPASFSTRT VSQGVDGLDI GLSPQPTSST VETDRFGNKF QRVSWNNVNQ DIRVNLNYTA NVRTQLTSLE SKTPFPLGSL ASRESLYLQH TEMVQGDAEI KSLARQLTSG VKNEYEAVSA IINWVTDNIK YTFNPPQYDA SYTLSTRSGN CQNFAHLSIA LLRSVGIPAR IVGGITLKEG WKVPIDAKNS IVQSMGQGGH AWLEVYFPDL GWLPYDPQQS RQFTSSRHIK QTHGLDSRDI NDTWKGGPYL PAYSELIEGR YTTDDIKLKM KRYASAPRPY ILSNAVMAGS AGVVDTADSG RETPPAAKPP RPESRTNVAS AKPATPPQGS GAGPAETRLV SDDEASDEDQ PRPVKPKPPV KPVKPPVTKP PKVATKPPVV VEPGHKKPRP GTMLAFGNMD FPALVNLYNV KGDTGTRVFE RETSEYVTSK HIFAQAIQVK DRMSLEKISL AMRKFGGDGA VYIDLVKDKG GKPDIMQGIR SNLLDLEKIV RRPGYYWVDF SFPPDGNKPQ QLAPGKYWIV FRASGEAIMN WFFTPGKPYG EGDDTRSTAK GFQWEDILNY DFVFKVSGKA V
|
| |