Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_1759 |
Symbol | |
ID | 6367495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010814 |
Strand | + |
Start bp | 1871288 |
End bp | 1873732 |
Gene Length | 2445 bp |
Protein Length | 814 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642677164 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_001951995 |
Protein GI | 189424818 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0611654 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCCTG AACGTCCGCT CGTAATACCA CTTGCCTTTC TTGCTGCCGG CAGCCTGGCA GAGTATCTGT TGGCTATTCC TTTTTCCAGA TTGATTCCAG TGGTTCTGTT AGTTTTGCTT GTTTTTGCTC TGCCGTTTAG GAAGCAGGTG CTGTTCAGCT GTCTGCTGGC ATTATTCTGG ATGAGCTGGG AGATGGCTGC CCTGGCTCCC AGGCTGGATA TCAGGAATGT CAGGACAGGA ATCTCAGGCT ATGAGGGAAA GCAGCTTCTG GTTGAGGGGA TAGTGGTGCG GCGTCCAGCC ATCCTGCCTG AAGGAGAACG GCTCGTGCTG CAGGTCGAGC GGGTCTTTGC AGGCAGCGCA GAGTCTCTCT CGAGCGGTTC CCTGTTGTTA ACCATTGCAA AGGGGCACGG GAGCTGGCTG ACCGGTGACC GGATACGTTG TCCGGTGAAA GTCAGGGTTC CACAACTGCT TGGGCTGCCC GGTGAGTTTG ATTACGGTCG GTATCTGACA TTGCGCGGCA TTGAAGCAAC TGGCTGGATA CCTGATGCTG AATCGGTTGT TCTGATGCGT GGAGCTGCCA GAGCGTCATG GCAGCGCAGC ATAGATAGTC TGGCCATGCG CAGCCAGGAA TTCATTCGCC AGTGCCTGCC CGACTCCGCT CAGCGTGGCG TCGTGCTTGC CCTGGCAACC GGCAACCAGC AGGAGGTGCC TCCAGATGTA GCTGCAGCCT ATACTCGGGC TGGTGTTACC CATATACTGT CCGTTTCCGG CTTCCATGTC GGGGTGGTGA CAGCGGTCTG GGTCATCATG CTTAGATGGC TGATGCTGCG GTGGGAATGG CTGGCTCTGC AGCTGGATCT CCGCAGAGCT GCCTTGTTGT CAACGCTGCC GGTTATGTTG CTGTATCTGG TCTTTACGGG GGGAGCGCCG GCCACTGCAA GGTCAGTATT TATGGTGGCG GCAGTGGTGT TGGCGGCCTG GAGTGAACGT GAAATAGATC TCCTGGATGC GTTGTTACTG GCAGCCTTTG TGCTGTTACT GCAAGATCCT GCTGTGCTGT TTAATCTCTC GTTCCAGCTT TCCTTCTTGT CTCTGTGGGG ATTGCTTGTC CTGACACCAC TTTTGACTAA ACCTGTTGAG CATCTGCTAA AGCAGGAATG GCAGCGTATG ACAGTACTGT TTTTTGCCGC CTCTCTGGCT GCTGTGCTTG CAACTATGGC ACCGGTGCTT GCATCATTTC ATCAGGTTTC ATTTACCGGA ATAGCAGCTA ATCTGGTGGT GGTGCCATTG CTCGGTTACG GTGCAACAGT TCTGGCAACA ATGGCAGTCC CTCTGTCATT TTTCATGCCG TCGTTTGCTG CCTTGGTGCT GATGTTTACA GGCTGGCTGG TGCAGTTGAG TAATACTTTT GTCCAGTGGA TCGCCCGGGT TCCGGTACTT CACAGTTTCA GTGCCGGCAG CACTGATGTG GTGATAACCA TTGCCTTGCT TGCTGTACTT GGCTTTGTAC ATGGACGCCG GACCAGGATG TATGCCGGCA GCCTGCTGCT GGTTGCCCTG GTGCTGGTGC ACCTCTGGCC TGGTCCTGCA CTGGATGGGA AGCTCAGGAT GACTTTTTTG AGTGTGGGGC AAGGGGATGC AACCTTGATT CAATTGCCTG ACGGGCGTAC CATGCTGGTG GATGGTGGTG GTTATCTGCG GGATACCGGC CGGGATTTCG GTGAACGATA CCTGGTACCC GCCTTGCACG GTCTGAGTGT TAAACAGATT GATATCATGG TGTTAAGCCA TCCACATCCT GATCATCTGG GAGGATTGCC TGCAGTTGCA GAACAGTTCA AGGTAGGGGA ATACTGGCAG GCAAAAGGCA GCGCTCAGGG GGCTGACTAT CAGCGTTTGA TAAATGCGTT GGCGCATCAG AAAACAACTA CGCGGATCTT GCAACAGGGG GATCGTCTTC AGGTTGGGGA GGGGGTGCTG GTTTCTGTTC TGTCATCACC GCAAGGGGAA CAGTCCCTGA AAACCGATAA CGACGATTCA CTGGTGCTGC AGCTTCAGCA GGCAGGTTTC AGTGCTTTGA TGATGGGCGA TGCCGGTTTT CCGGTTGAGG AGATGCTGCT GAGCCAAGGG ATAGGACCGG CAACCGTGCT CAAGGTGGGA CACCATGGCA GTAAAACTGC CACAGGTGAG AGCTTTCTGC GACGAATCAA GCCAAACGTT GCAGTGGTAT CGGTTGGTGC AGGCAACAGT TTTGGTCTGC CGGCGGACGA GACGCTGGAT AAAATCAGGC ACCAGGGGGC TGTTCTTTAT CGTACCGACC AGCAGGGTAC TATTCAGCTG CTCAGTGACG GACACACCTA TACGGTTGGG CCGCTTGTGA CGGAAAATGG ATTGGTGCGA GCAATCAGAC GTTTTGCTTT GACAGCCAGT AACCAGCTGC GATAA
|
Protein sequence | MIPERPLVIP LAFLAAGSLA EYLLAIPFSR LIPVVLLVLL VFALPFRKQV LFSCLLALFW MSWEMAALAP RLDIRNVRTG ISGYEGKQLL VEGIVVRRPA ILPEGERLVL QVERVFAGSA ESLSSGSLLL TIAKGHGSWL TGDRIRCPVK VRVPQLLGLP GEFDYGRYLT LRGIEATGWI PDAESVVLMR GAARASWQRS IDSLAMRSQE FIRQCLPDSA QRGVVLALAT GNQQEVPPDV AAAYTRAGVT HILSVSGFHV GVVTAVWVIM LRWLMLRWEW LALQLDLRRA ALLSTLPVML LYLVFTGGAP ATARSVFMVA AVVLAAWSER EIDLLDALLL AAFVLLLQDP AVLFNLSFQL SFLSLWGLLV LTPLLTKPVE HLLKQEWQRM TVLFFAASLA AVLATMAPVL ASFHQVSFTG IAANLVVVPL LGYGATVLAT MAVPLSFFMP SFAALVLMFT GWLVQLSNTF VQWIARVPVL HSFSAGSTDV VITIALLAVL GFVHGRRTRM YAGSLLLVAL VLVHLWPGPA LDGKLRMTFL SVGQGDATLI QLPDGRTMLV DGGGYLRDTG RDFGERYLVP ALHGLSVKQI DIMVLSHPHP DHLGGLPAVA EQFKVGEYWQ AKGSAQGADY QRLINALAHQ KTTTRILQQG DRLQVGEGVL VSVLSSPQGE QSLKTDNDDS LVLQLQQAGF SALMMGDAGF PVEEMLLSQG IGPATVLKVG HHGSKTATGE SFLRRIKPNV AVVSVGAGNS FGLPADETLD KIRHQGAVLY RTDQQGTIQL LSDGHTYTVG PLVTENGLVR AIRRFALTAS NQLR
|
| |