Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_3568 |
Symbol | |
ID | 5159367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009470 |
Strand | - |
Start bp | 16679 |
End bp | 19669 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640538875 |
Product | transposase Tn3 family protein |
Protein accession | YP_001220308 |
Protein GI | 148244071 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTCATC GACGGATATT GACCGGGGAA CAGATCGCAC AGCTGTTCGA CCCGCTGACT GATCGACGCG GCATAATTCG TCACTACACG CTGTCTGCTG CGGATCTTGC AATGATCCGG CGCGGGCGGG GCGATCATCA CCGTCTCGGT CTGGCTCTCA TGCTCTGTTA CCTGCGGTAT CCCGGCCGGC CGCTCCATGC CGGCGAGATA CCCGATCCTG CCCTGGTCTC GTTCGTCGCC ACACAGATCG ACGTCTTGCC GGATTCCCTC GGTGCTTATC TGAGGGTGGA TCAAAATCGA CGCCGGCATT CGGCGGCATT GCAGGATCGC CTCGGGTTGC GGCCATGGGG GCCGCGCGTG GCGGCAGACC TTGCCGACTG GCTGCTGGCG CATGCACTGG AAGCCGACCG GCTGGTCGAT CTGGCGGCTT TGGTGCTGGA AGAGTGCCGC GGTCGCGGGA TTCTTCTACC GCCCCCGGCG CGCCTCGAGC GGCTCTGCAT CGAGGTTCGA TACCGCGCGC GCCGGGAAAT CGAGCGTCGG CTGACCGCGG GTCTGTCCGC CGATCAGCGG CGTCGGCTGG ATGCGTTGAC GGAGCGGCGC CTGGACACCA GCCAGAGCTG GCTGGCCTGG CTTCGCCAGA TCCCGGAATC GGCAAAGCCG GCGTCCATGC TCGGGGTGAT CGAGCGGTTG GAGCACATCC GGGCAATCGA TATCGATCCG GCGCGCGGTC ACCGGATCAA TCAATCCCGG CTCGCCCAGC TTGTTCGCGA GGCCGGCCGG ACCACGGTGC AGCATGTTGC CGGCTACGAG CGCCGGCGCC GGCATGCCGT GCTCACCGCG ATCGATCTCG ATCTGTCGGC CCGCCTGACG GATCAGGCGA TCACTCTGTT CGAGCGGCTG ATCGGCGCGA TGTTCCGCAA GGCGGAAGGC CGACATGCCC GGGCGTTCCA GGCCGAAGGC CGGGCGATCA ACGAGAAGGT CCGGCTTTAT GCGAGGATTG GCGCCGCCCT GATGGCCGCG CGCGCCGGCC AGGGCGATCC CTTCGCGGCG ATTGACAAGG TGATACCCTG GGACCGGTTC TGCTCGACCG TTATGGAAGC GGAAACTCTG GTTCGTTCGG AAGATTTCGA TCCCTACGAG GTGCTGAGCG AACATTATGC CGGCATTCGG CGCTGGGCGC CGGCGTTTCT GGCCACTTTC GAGTTTCAGG GTGTTCCCGC CGCGGCGTCG TTGATGCGCG CGATCGCCAT GCTGCGCGCC ATGAACAGCG CGGGAGCCTC GACGCTGCCC AAGTCAGCGC CGACCGCCTT TGTCCGGCCG CGCTGGGCGC GGCATGTCCT CACGGCACAC GGCATCGACC GGCGTTATTA CGAACTCTGC GTCCTGGCGG AATTGCGCCA GCGCCTTGCT GCCGGCGATG TCTGGGTTAC CGGCAGCCGG CAATACCGGG CCTTCGAGGA GCGGTTGATC TCCAGGGAGA CGCTGCAGGT CCTGCAGAAA GCGTCCGGCA TTCCGGTTGC CATCGAGACT GACTTCGATC GCTTTATCGA GACCAGGCGG ACCTGGCTGG ATGCGCGGCT GGCCGAGGTC GACGCCCGTG CCCGGGGCGG GCTGCTTCCC GATGTGACGA TCGAGAAGGG CGTGCTGAGA ATCACGCCGA TCGAAAAATC GACTCCGCCC GAGGCCGAAG CGCTGGCAGC GAAGCTCTAT GCCACGCTGC CCCGCATCCG GATCACCGAC CTGCTGACGG AAGTGGCCGG CTGGACCGGC TTCCTGGACT GCTTCACCCA TTTGCGCACC GGCGAAGCCG CCGCTGATCC CCGGGTGCTG ATGGCGGGGC TGCTGGCCGA TGGCCTCAAT CTCGGCCTGA CCCGGATGGC CGAAGCCTGC AGCATCGCCA GTCTGGGCCA GCTCGCCTGG ACCGCTGACT GGCACATCCG CGACGAGACC TACGCCCTGG CGCTGCGCCG CCTGGTCGAA CACCAGAGCC GCGAGCCGCT CGCCGCCCTC TTCGGGTCGG GAACCGCCTC ATCCTCGGAC GGGCAGTTCT TCCGCGCCGG CGGTTCTGGC CGTGATGCGA GCCGGATCAA TGCGCACTAC GGCCCCGAAC CGGGTCTGAA ATTCTACACC CATCTCTCCG ATCGCTACGC GCCGTTCCAT ACCAGGGTGA TCGCCGCAAC CGCCAGCGAA GCCCTGCATG TCCTCGACGG CCTGCTTGAT CATCACGGCG ATGCGCCGCC ACGTCAGCAC CGGCACCATA CCGATGGCGG CGGGGTGTCG GATCATGTCT TTGCCCTGTG CGCCCTGCTC GGATACGTAT TCGCGCCGAG AATTCCTGAC CTGAAAGACC GGCGTCTCTA CAGTTTTGCC AGACCGGCAG CCTATCCGAC GCTCGCGCCG ATGATCGCCG GCCGCATCAA CGTCGATCTC ATCCGCGCCC ATTGGCCCGA TCTCCTGAGG ATCGCCACCT CGATCCGCAC CGGCACGGTG TCCGCGTCGG TGATCCTGCG ACAACTCGCC GCCTACCCAC GACAGAACGC CGTTGCCGCG GCACTGCGCG AACTCGGCCG TCTCGAGCGG ACGCTGTTCA CCCTCGACTG GCTGGAAGAT CCGGGCCTGC GCCGTGAAAG CAGTCATGAA CTCAACAAGG GCGAGGCCCG CAACAGCCTG GCCCGCGCCG TCTTCATTCA CCGGCTCGGC GAAATCCGCG ACCGGACCTT CGAGAACCAG ACACATCGAG CCTCTGGCCT GAATCTCCTG GTCACCGCCA TCATCCTCTG GAACACACGC TATCTCGCGC AAGCCATACA GGCCCTACGC CAGGTCGAGG ATGTGCCCGG AACCCTCCTC AGACATCTCT CGCCGATCGG CTGGGAGCAT GTGAACCTGA CCGGCGACTA CATCTGGAGC GCCAATCAGA AATCGACGGA AAACCATGCC GGATTGCGGC CGCTCCGGCC AATCCCCGAC ACGACCACCC ACGCAGCCTG A
|
Protein sequence | MGHRRILTGE QIAQLFDPLT DRRGIIRHYT LSAADLAMIR RGRGDHHRLG LALMLCYLRY PGRPLHAGEI PDPALVSFVA TQIDVLPDSL GAYLRVDQNR RRHSAALQDR LGLRPWGPRV AADLADWLLA HALEADRLVD LAALVLEECR GRGILLPPPA RLERLCIEVR YRARREIERR LTAGLSADQR RRLDALTERR LDTSQSWLAW LRQIPESAKP ASMLGVIERL EHIRAIDIDP ARGHRINQSR LAQLVREAGR TTVQHVAGYE RRRRHAVLTA IDLDLSARLT DQAITLFERL IGAMFRKAEG RHARAFQAEG RAINEKVRLY ARIGAALMAA RAGQGDPFAA IDKVIPWDRF CSTVMEAETL VRSEDFDPYE VLSEHYAGIR RWAPAFLATF EFQGVPAAAS LMRAIAMLRA MNSAGASTLP KSAPTAFVRP RWARHVLTAH GIDRRYYELC VLAELRQRLA AGDVWVTGSR QYRAFEERLI SRETLQVLQK ASGIPVAIET DFDRFIETRR TWLDARLAEV DARARGGLLP DVTIEKGVLR ITPIEKSTPP EAEALAAKLY ATLPRIRITD LLTEVAGWTG FLDCFTHLRT GEAAADPRVL MAGLLADGLN LGLTRMAEAC SIASLGQLAW TADWHIRDET YALALRRLVE HQSREPLAAL FGSGTASSSD GQFFRAGGSG RDASRINAHY GPEPGLKFYT HLSDRYAPFH TRVIAATASE ALHVLDGLLD HHGDAPPRQH RHHTDGGGVS DHVFALCALL GYVFAPRIPD LKDRRLYSFA RPAAYPTLAP MIAGRINVDL IRAHWPDLLR IATSIRTGTV SASVILRQLA AYPRQNAVAA ALRELGRLER TLFTLDWLED PGLRRESSHE LNKGEARNSL ARAVFIHRLG EIRDRTFENQ THRASGLNLL VTAIILWNTR YLAQAIQALR QVEDVPGTLL RHLSPIGWEH VNLTGDYIWS ANQKSTENHA GLRPLRPIPD TTTHAA
|
| |