Gene Acry_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1105 
Symbol 
ID5160841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1229654 
End bp1232644 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content65% 
IMG OID640553020 
Producttransposase Tn3 family protein 
Protein accessionYP_001234237 
Protein GI148260110 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTCATC GACGGATATT GACCGGGGAA CAGATCGCAC AGCTGTTCGA CCCGCCGACT 
GATCGACGCG GCATAATTCG TCACTACACG CTGTCTGTTA CTGATCTTGC AATGATCCGG
CGCGGTCGGG GCGATCACCA TCGTCTCGGT CTGGCTCTCA TGCTCTGTTA CCTGCGGTAT
CCCGGCCGGC CGCTCCATGC CGGCGAGATA CCCGATCCTG CCCTGGTCTC GTTCGTCGCC
ACACAGATCG ACGTCTTGCC GGATTCCCTC GGTGCTTATC TGAGGGTGGA TCAAAATCGA
CGCCGGCATT CAGCGGCTTT GCAGGATCGC CTCGGGTTGC GGCCATGGGG GCCGCGCGTG
GCGGCAGACC TTGCCGACTG GCTGCTGGCG CATGCACTGG AAACCGACCG GCTGGTCGAT
CTGGCGGCTT TGGTGCTGGA AGAGTGCCGC GGTCGCGGGG TTCTTCTACC TCGCCCGGCG
CAGCTCGAGC GGCTCTGCAT CGAGGTTCGA TACCGCGCGC GCCGGGAAAT CGAGCGTCTG
CTGACCGAGG GTCTGTCCGC CGATCAGCGG CGTCGGCTGG ATGCGTTGAC GGAGCGGCGT
CTGGACACCA GCCAGAGCTG GCTGGCCTGG CTGCGCCAGA TCCCGGAATC GGCAAAGCCG
GCATCCATGC TCGGGGTGAT CGAGCGGTTG GAGCACATCC GGGCAATCGG CATTGATCCG
GCGCGCGGTC GCCGGATCAA TCAATCCCGG CTCGCCCAGC TCGTTCGCGA GGCCGGCCGG
ACCACGGTGC AGCATGTTGC TGGCTACGAG CGCCGGCGCC GGCATGCCGT GCTCACCGCG
ATCGATCTCG ATCTGTCGGC CCGCCTGACG GATCAGGCGA TCACTCTGTT CGAACGGCTG
ATCGGCGCGA TGTTCCGCAA GGCGGAAGGC CGACATGCCC GGGCGTTCCA GGCCGAAGGC
CGGGCGATCA ACGAGAAGGT CCGGCTTTAT GCGAGGATCG GCGCCGCCCT GATGGCCGCG
CGCGCCGGCC AGGGCGATCC CTTCGCGGCG ATTGACAAGG TGATACCCTG GGACCGGTTC
TGCTCGACCG TTATGGAAGC GGAAACTCTG GTTCGTTCGG AAGATTTCGA TCCCTACGAG
GTGCTGAGCG AACATTATGC CGGCATTCGG CGCTGGGCGC CGGCGTTTCT GGCCACTTTC
GAGTTTCAGG GTGTTCCCGC CGCGGCGTCG TTGATGCGCG CGATCGCCAT GCTGCGCGCC
ATGAACAGCG CGGGAGCCTC GACGCTGCCC AAGTCAGCGC CGACCGCCTT TGTCCGGCCG
CGCTGGGCGC GTCATGTCCT CACGGCACAC GGCATCGACC GGCGTTATTA CGAACTCTGC
GTCCTGGCGG AATTGCGTCA GCGCCTTGCT GCCGGCGATG TCTGGGTTGC CGGCAGCCGG
CAATATCGGG CCTTCGAGGA GCGGCTGATC TCCAGGGAAA CATTTCAGGT CCTGCAAAAA
GAGTCCAGCA TTCCGGTCGC TGTCGAGACC GACTTCGAGC GCTTCATCGC GACCAGGCGG
ACCTGGCTGG ATGCGCGGCT GGCCGAGATC GACATCCGTG CCCGGGGCGG GCTGCTTCCC
GATGTGACGA TCGAGAAGGG CGTGCTGAGA ATCACACCGA TCGAAAAATC GACTCCGCCC
GAGGCCGAAG CGCTGGCAGC AAAGCTCTAT GCCGCGCTGC CCCGCATCCG GATCACTGAC
CTGCTGACGG AAGTGGCCGG CTGGACCGGC TTCCCGGATT GCTTCACCCA TCTGCGCACC
GGCGAAGCCG CCGCTGACCC CCGGGTTCTG ATGGCGGGGC TGCTGGCCGA TGGCCTCAAT
CTCGGCCTGA CCCGGATGGC CGAAGCTTGC AGCATCGCCA GTCTGGGCCA GCTCGCCTGG
ACCGCTGACT GGCACATCCG CGACGAGACC TACGCCCTGG CGCTGCGCCG CCTGGTCGAA
CACCAGAGCC GCGAGCCGCT CGCCGCTCTC TTCGGGTCGG GAACCGCCTC ATCCTCGGAC
GGGCAGTTCT TCCGCGCCGG CGGTTCTGGC CGTGATGCGA GCCGGATCAA TGCGCACTAT
GGCCCCGAAC CGGGTCTGAA ATTCTACACC CATCTCTCCG ATCGCTACGC GCCGTTCCAT
ACCAGGGTGA TCGCCGCAAC CGCCAGCGAA GCCCTGCATG TCCTCGACGG CCTGCTCGAT
CATCACGGCG ATGCGCCGCC ACGTCAGCAC CGGCACCATA CCGATGGCGG CGGGGTGTCG
GATCATGTCT TTGCCCTGTG CGCCCTACTT GGATACGTGT TTGCGCCGAG AATTCCCGAT
CTGAAAGACC GGCGTCTCTA CAGTTTTGCC AGACCGGCAG CCTATCCGAC GCTCGCGCCG
ATGATCGCCG GCCGCATCAA CGTCGATCTC ATCCGCACCC ATTGGCCCGA TCTCCTGAAG
ATCGCCACCT CGATCCGCAC CGGCACGGTG TCCGCGTCGG TGATCCTGCG GCAACTCGCC
GCCTACCCAC GACAGAACGC CGTCGCCGCG GCACTGCGCG AACTCGGCCG TCTCGAGCGG
ACGCTGTTCA CCCTCGACTG GCTGGAAGAT CCGGGCCTGC GCCGTGAAAG CAGTCATGAA
CTCAACAAGG GCGAGGCCCG CAACAGCCTG GCCCGCGCCG TCTTCATTCA CCGGCTCGGC
GAAATCCGTG ACCGGACCTT CGAGAACCAG ACACATCGAG CATCTGGCCT GAACCTCCTG
GTCACCGCTA TCATCCTCTG GAACACACGC TATCTCGCGC AAGCCATACA GGCCCTACGC
CAGGTCGAGG ATGTGCCCGG AACCCTCCTC AGGCATCTCT CGCCGATCGG CTGGGAGCAT
GTGAACCTGA CCGGCGACTA CATCTGGAGC GCCAAACAGA AATCGTCGGA AAACCATGCC
GGATTGCGGC CGCTCCGGCC AATCCACGAC ACGACCACCC ACGCAGCCTG A
 
Protein sequence
MGHRRILTGE QIAQLFDPPT DRRGIIRHYT LSVTDLAMIR RGRGDHHRLG LALMLCYLRY 
PGRPLHAGEI PDPALVSFVA TQIDVLPDSL GAYLRVDQNR RRHSAALQDR LGLRPWGPRV
AADLADWLLA HALETDRLVD LAALVLEECR GRGVLLPRPA QLERLCIEVR YRARREIERL
LTEGLSADQR RRLDALTERR LDTSQSWLAW LRQIPESAKP ASMLGVIERL EHIRAIGIDP
ARGRRINQSR LAQLVREAGR TTVQHVAGYE RRRRHAVLTA IDLDLSARLT DQAITLFERL
IGAMFRKAEG RHARAFQAEG RAINEKVRLY ARIGAALMAA RAGQGDPFAA IDKVIPWDRF
CSTVMEAETL VRSEDFDPYE VLSEHYAGIR RWAPAFLATF EFQGVPAAAS LMRAIAMLRA
MNSAGASTLP KSAPTAFVRP RWARHVLTAH GIDRRYYELC VLAELRQRLA AGDVWVAGSR
QYRAFEERLI SRETFQVLQK ESSIPVAVET DFERFIATRR TWLDARLAEI DIRARGGLLP
DVTIEKGVLR ITPIEKSTPP EAEALAAKLY AALPRIRITD LLTEVAGWTG FPDCFTHLRT
GEAAADPRVL MAGLLADGLN LGLTRMAEAC SIASLGQLAW TADWHIRDET YALALRRLVE
HQSREPLAAL FGSGTASSSD GQFFRAGGSG RDASRINAHY GPEPGLKFYT HLSDRYAPFH
TRVIAATASE ALHVLDGLLD HHGDAPPRQH RHHTDGGGVS DHVFALCALL GYVFAPRIPD
LKDRRLYSFA RPAAYPTLAP MIAGRINVDL IRTHWPDLLK IATSIRTGTV SASVILRQLA
AYPRQNAVAA ALRELGRLER TLFTLDWLED PGLRRESSHE LNKGEARNSL ARAVFIHRLG
EIRDRTFENQ THRASGLNLL VTAIILWNTR YLAQAIQALR QVEDVPGTLL RHLSPIGWEH
VNLTGDYIWS AKQKSSENHA GLRPLRPIHD TTTHAA