Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0813 |
Symbol | |
ID | 8136129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 967474 |
End bp | 968757 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868428 |
Product | integrase family protein |
Protein accession | YP_003020642 |
Protein GI | 253699453 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0000000000471518 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGTTACT ACCTTCGTGA GGGGCGAGGC TTCGCCCTGC GGGTCATGCC GTCCGGTCTC AAGAGCTTCG TCTATATCTA CGAACTGAAC AAGCGGAAAG GGTACCTGCT GCTCGGCCAC TATCCAGGCT GCTCTCTCGC CGAAGCACGC ATCGCCTTCA ACCACGCTTT CAACCTGGTA AAGAGAGGGA TCGACCCGCT TGGTCAGAAG AAAACCGTGG CAGAGGAGCG CGACAGGGCG GTCAGGGAAG CCGCCCGGGA GGCGGAGGCC CGTGCCGTCG CGGCGGACCA CCTGGAGCGA CTTACTTTTG AGACCCTGCT GAAAGACGGG ATCCCGGACG ACTTCACCCC CAAAACGGTG GAACAGCTCG CCGCAGTCTG GATGATCAGG TACTCGAGGG CTAACCACAC GGAGCGCTGG CAGGGAAGCG AACTCTGCTC TCTCAGGCTG CACATCCTTC CTGCCCTGGG AAAGGACGAT ATAACCGCGG TACGGCGCAA GCACGCAGTG ACCTTCATCG AGCGGCTCGC CGCCAGATTC CCAGGTGCCG CCCGAAACGC CATGAAGCTA TGCCGGCAGA TGTTCAAGTA CGCTTACCGC CAGGAATGGG CGGAGATCCA GCCGTTCAAC GAGATAACGG AGTCGGTGCC GAAGATCGCC CCGCGGGCCG ACGAGCGCCA CCTGGACGAC GACGAGATCG TCAGGGCATG GCGCGAGATC AGCGACTCGG CGAGCTCCCT CTACGTCAAG CGGGCATTGA AGCTTATCCT CGTCACCGCA CAGCGGCCGG GGGAGGTCGC CCAGATGCAC CGCAACCAGA TCAAGGAGAG ATGGTGGACC ATCCCGGCGG AGGTGGCAAA GAATCGGAGG GATCATCGCG TGTACCTAAC GGACACGGCC CTCGAGCTGG TCGGGGACGG CGCCGGTTAC ATCTTCCCCT CTGAGAAGGG AAAAACCCCC TTCCTCTCCG CTAACAGCCT TTCCCAAGCG ATCAACCGCG GCTACCCGGC TGTGGAAGCC ACGAAGCTAG TGGGGAACCA GACTATCAAG GCCGCGAAGA ATTGCTATTT CGGCATGAAG CCCTGGTCGC CCCAGGACCT GCGCCGTACC GCCCGCACCA ACATGGCGCG GGTGGGAGTC ATCGACGAGA TCGCCGAGGA GGTGGTGAAC CACAAGAAAT CAGGGATCGT CGGGGTCTAC AACAAGTACC GCTACGATAA AGAGAAGGAA GTGGCCCTGA CCAAGTGGGA GCAGCTACTG ATCGAGATAC TGAAGGGGGG CTGA
|
Protein sequence | MRYYLREGRG FALRVMPSGL KSFVYIYELN KRKGYLLLGH YPGCSLAEAR IAFNHAFNLV KRGIDPLGQK KTVAEERDRA VREAAREAEA RAVAADHLER LTFETLLKDG IPDDFTPKTV EQLAAVWMIR YSRANHTERW QGSELCSLRL HILPALGKDD ITAVRRKHAV TFIERLAARF PGAARNAMKL CRQMFKYAYR QEWAEIQPFN EITESVPKIA PRADERHLDD DEIVRAWREI SDSASSLYVK RALKLILVTA QRPGEVAQMH RNQIKERWWT IPAEVAKNRR DHRVYLTDTA LELVGDGAGY IFPSEKGKTP FLSANSLSQA INRGYPAVEA TKLVGNQTIK AAKNCYFGMK PWSPQDLRRT ARTNMARVGV IDEIAEEVVN HKKSGIVGVY NKYRYDKEKE VALTKWEQLL IEILKGG
|
| |