Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_3698 |
Symbol | |
ID | 6369562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010815 |
Strand | + |
Start bp | 52063 |
End bp | 54987 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642679112 |
Product | transposase Tn3 family protein |
Protein accession | YP_001953917 |
Protein GI | 189426741 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0861191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.131988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTTGGGATAC TGATGAATTG GCTACACATT GGTGCTTAAC GTTTGAAGAT CACCACCTTT TAAAAAACAA ATTGCTCAAA AACCATCTTG GGTTTTCGAT CCAGCTTAAG CATTTTCAGT ATAGCGGAAA ATTTTTACAT ACCCATTCTG ATATCTCAAC CCCTCCCCTT GAGCATGTGG CTGAACAGCT CAATGTTTCA GTTTCTGATT TTGAATCATA CGATTTCAAT AGTCGCACCG GGCGACGGCA TTGCATTGAA ATCCTGAACT ATCTCGGAAT CCAGCGCCTG ACCTCGACAG ACAAGGACGC TTTTTCTGAT TGGCTTCGAC AAGAAATTTT CCACCAAGGG ACGACTATTT CTGAGGCCAT TGAGTTGGCT TATGATTGGT TCAAAAAACA GAAAGTTGAA TACCCCACTG AGGCCGTATT GGAGCGCCTT GTCCGATCTG CCTTTTATCG CTACGAACAA GAATTTTTCA ACCAGATCAT CCGTGAACTA CGACCTTCAG CTAAAGACAA GATGGATCAT TGCCTTGAGA ATGTTGAAAG AGGTATCGAA TTTGGTCGTT TGAAAGCTGA TCCAGGCCGC GTCGGTCTGG AGAGTGTCTT GGCTGAAGTT GAAAAACTCC ATTTCATACA GTCTCTTGAC CTTCCACAGG GTCTCTTTCA AACCTGTAAC ATTAAAGCCT TAACGCACTA CTATCAACGG GTCAGCAGTG AAAGTGCCTG GAGGGTCAAG GAACACCCGC CTGAAATCAG ATATGCCCTG CTGGGGGTTT TTCTCTTTTT CCGGCAGCGA GAAATCATAG ATGGTCTGAT TGAACTCTTC ATCCAGATTG TTCACCGCCT CACAGTTAAA GCTGAACGGA AATTGATCAA GGAGTTGCTA AGTGATTTCC GGAAAGTCCA TGGAAAGAGC ACCCTCCTGT TTAGAATTGC AGAGGCGGCA CTCTTAAATC CTGAGGGGCG GGTGAAGGAT GTCGTCTACC CTGTTGCTGG GGAGAATGTG CTGCAAAATC TGCTCAAAGA GTTCAAATCT TCCGGCCCTG GCTACAGACA GCAGGTTCAT AAAATCATTC GTTCCTCTTA TGGCAACCAT TATCGGCGCA TGGTTCCGAA AATCCTTGAA GCTCTTTCCT TCTGCTCAGG CAATACGCAG CATCGACCTG TGCTGGAAGC GCTGGAATGG ATTCAACATA ACCGTGACAA CTCTCAACGT TTTATCCCGC TGGACGAGGG TATTCCGATA GACGGTGTTA TTCGCAAGCA AGATCAAGAA GTGGTACTCG AAGAGGATGC TCAAGGTAGA GAGCGAATCA ACCGTATCAA CTATGAAATC TGTGTCCTCC AGGCTTTGCG TGACAAACTT CGCTGTAAAG AAATCTGGGT GATGGGCGCT GACAAGTTTC GTAACCCAGA CGAGGATCTA CCAGCCGACT TTGAGGACAA ACGCGAGGAT TATTACCTCG ATCTTGGCCA TTCGACAGAC AGCTCAGAGT TCATCGAGAA AATTCAGGAA CGGATGCGCT CGGCTCTTAC CGAGCTTGAC GAGGGCATTC CGAGAAATCA GAAGGTTCGG CTTCTCAATA GAGGTAAGAA GAACATATCC ATCACGCCTT TCGAAGCCCA AGAAGAACCG CCAAGCCTTA CCGCGTTGAA ACGGGAAATC GCTGGTCGGT GGCCAATGAC CAGTTTGCTC GATGTGCTCA AAGAAGCGGA TTTGCGAGTT GGCTTTACCG ATCATTTCAA GACTGTTGCT GATCGAGAGA TTCTTGATCG GCAAAGTCTG CAACGGCGGT TGTTGCTCTG TCTTTATGGG ATGGGAACCA ATACTGGTCT AAAGCGGGTC AGCGGCAACC GTCACGGAAT CAGCTATAAG GAGCTGCTGC ATGTCAGGCG ACGCTACGTT CACAAGGCTG CCCTCCGGAA TGCTATCGGC CAAGTCGCCA ATGCCATTTT CAGCATACGC AACGCCGATG TCTGGGGAGA GGGTTCAACT TCTTGTGCTT CCGACTCAAA GAAATTCGGC TCTTGGGATC AGAACCTGAT GACGGAATGG CATATCCGCT ACGGTGGTCG CGGAGTCATG ATCTACTGGC ACGTCGAAAA GAAATCGACC TGCATCTATT CTCAATTGAA GCGCTGCTCT TCCTCGGAAG TTGCCGCCAT GATCGAGGGT GTCTTGCGGC ACTGTACAGA CATGACGATA GATCGGCAGT ATGTTGATAG TCACGGGCAA AGCGAGGTGG CTTTTGCTTT CTGCCATCTG CTTGGCTTTG ATCTCCTGCC GAGATTGAAG GCGATTGCCA CACAGAAACT CTACCGTCCT GACGGTGACG CAACCGAGGC CTTTCCGAAC CTGGAAAGGA TCATGACCCG CCCGATCAAT TGGGAGCTGA TCAGCCAGCA GTATGATGAG ATGGTCAAGT ACGCCACCGC ACTGAAACAG GGAACCGCCA ACCCGGAAGC CATACTACGA CGATTTACCC GCAACAATAT TCAGCATCCG ACCTACCGGG CGCTCTCCGA ACTCGGTAAA GCAGTGAAGA CGATATTCTT ATGTCGCTAT CTCGGCTCGG AAGCCTTGCG GCAGGAAATT AACGAAGGTT TGAATGTCGT TGAGAATTGG AACAGCGCCA ATTCTTTCAT CTTCTACGGT AAGGGCGGCG AAGTCGCCAC CAATCGTTTG GAGGACCAGG AGTTGTCAGT CTTGGCCCTG CACCTGCTGC AGATCTGCTT GGTCTATGTC AATACGCTTA TGATTCAGCA GGTTCTGACC GAACCGGCTT GGCATTCGCG GATGAAGCAA GAGGACTACC GTGCTCTGTC GCCACTGATT TACAACCACA TCAATCCATA CGGCATCTTT GAGCTGGATA TGGATTTGCG GTTGCCGATT GAGTTAGTTG CTTAG
|
Protein sequence | MKKVWDTDEL ATHWCLTFED HHLLKNKLLK NHLGFSIQLK HFQYSGKFLH THSDISTPPL EHVAEQLNVS VSDFESYDFN SRTGRRHCIE ILNYLGIQRL TSTDKDAFSD WLRQEIFHQG TTISEAIELA YDWFKKQKVE YPTEAVLERL VRSAFYRYEQ EFFNQIIREL RPSAKDKMDH CLENVERGIE FGRLKADPGR VGLESVLAEV EKLHFIQSLD LPQGLFQTCN IKALTHYYQR VSSESAWRVK EHPPEIRYAL LGVFLFFRQR EIIDGLIELF IQIVHRLTVK AERKLIKELL SDFRKVHGKS TLLFRIAEAA LLNPEGRVKD VVYPVAGENV LQNLLKEFKS SGPGYRQQVH KIIRSSYGNH YRRMVPKILE ALSFCSGNTQ HRPVLEALEW IQHNRDNSQR FIPLDEGIPI DGVIRKQDQE VVLEEDAQGR ERINRINYEI CVLQALRDKL RCKEIWVMGA DKFRNPDEDL PADFEDKRED YYLDLGHSTD SSEFIEKIQE RMRSALTELD EGIPRNQKVR LLNRGKKNIS ITPFEAQEEP PSLTALKREI AGRWPMTSLL DVLKEADLRV GFTDHFKTVA DREILDRQSL QRRLLLCLYG MGTNTGLKRV SGNRHGISYK ELLHVRRRYV HKAALRNAIG QVANAIFSIR NADVWGEGST SCASDSKKFG SWDQNLMTEW HIRYGGRGVM IYWHVEKKST CIYSQLKRCS SSEVAAMIEG VLRHCTDMTI DRQYVDSHGQ SEVAFAFCHL LGFDLLPRLK AIATQKLYRP DGDATEAFPN LERIMTRPIN WELISQQYDE MVKYATALKQ GTANPEAILR RFTRNNIQHP TYRALSELGK AVKTIFLCRY LGSEALRQEI NEGLNVVENW NSANSFIFYG KGGEVATNRL EDQELSVLAL HLLQICLVYV NTLMIQQVLT EPAWHSRMKQ EDYRALSPLI YNHINPYGIF ELDMDLRLPI ELVA
|
| |