Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_4894 |
Symbol | |
ID | 5420545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009717 |
Strand | + |
Start bp | 109794 |
End bp | 112793 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640873558 |
Product | transposase Tn3 family protein |
Protein accession | YP_001409338 |
Protein GI | 154243765 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAC GGAGGCTTCT CAAGGTCCAA GATCGACAGA GACTTTTCGA TATACCAACC GATGAGGACG GCCTCATCCG GCACTATTCG TTGTCGTCGG CTGACAGGCT TGAGATTGGA CTTTGCAGAC GAGAACACAA TCGGCTCGGA TTTGCCGTTC AGCTCTGCCT GATGCGATAT CCAGGCAGGG TGTTGGCGAC CGATGAAACT CCGCCTCGCG CAATGCTAGA GTACGTTGCT GAGCAGATTG GCGCCGACGC TGGAAAGTTT GCGCTCTATG CACGCCGTGA AGAAACGCGG CGCGATCACA TTGCTCGCTT GATGGTTTAT CTGGCCGCGC GGAGCGCGAC GGGGCAAGAC CGTAGGGCTG CGCTGTTGGC TGCAATTCAG GCGGCCACGA TGTCCGACGA CGGTGGCGCG ATAGCGAGTG CTACTGTCGC CATGTTTCGT GAACGCGGAT CTCTTCTGCC AGCAATCGAC ACGATCGAAC GGATCGGTCT TGCTGCCCGC GCCATTGCCC GTCGGCGGGC AGAGAGAGCG CTGATCGAAG AAATTTCGGT CGATACGCTT CAATCGTTGG ATAAGCTGTT GGAGGTTGAC CCGGCCATCG GCCAGACGCG ATTTCACTGG CTGCGATCAG CGCCGGATGC GCCAGGTACG TCAAACCTGG TCGGGCTGAC CGAACGGATT GCCTTCCTGC GCGAGCTAGA AATCGATCCG AGATTGCAGA TACGCATATC GTCTGGACGG TGGGATCAGA TGATCCGTGA AGGCAACGCC ACACCGGCAT GGCTGGCCAA CGACTTCAAT GCCAGCCGTC GACACGCGCT GATCGTGGCG CAGATTATCA AGCTCGGCCA GAAGCTCACG GACGATGCAG TGTCGATGTT CATCAAGCTG ATAGGTCGGC TGTTCTCGCA AGCCAATAAC CGCAAGAAGC AGCGGCACAT GGACTGCAGG CCGGATACCG CCAAAGCGCT ACGCATGTTC CTGGACACGA TCACAGCCCT GCAGTCCGCG AACGATTATG GCCGGAACGC ATTGGAGGTT CTCGATCAGG AAGTTGGATG GCACCGGTTG CTTCGGATGA AGCCTGAGCT TGAGTCGATG GTCGACGACA ACGAGGCATC GCCCTTGACC TTAGCGGTCG AGCAATATGC CACCGTCAAC AAGTATGCCG GTGCGTTTCT GCAAGCGTTC ACGTTCCGCT CAGCGCGCCG CCACGATCCC CTTCTTGCGG CGATTTTCCT GCTGAAGCGG CTCTATGCCG AGAAGCGGCG GACCCTTCCG GATCGCGTCC CGGTCACCCA CCTCAGCCAA GTTGATCGAC GGCTAATCCT CGGGCAGGAG AAGCCCGATC GCCGTCTCTA TGAGATTGCA ACCCTCGCGG CTTTGCGAGA CCGGCTTAGA TCTGCGGACA TTTGGGTCGA TGGCAGCCGA TCCTTCCGAC CGATCGACGA GCACCTGATG CCGCGGTCAA CGTTCACCAT CCTGAAAGAT GAAGATCGCC TCGGACTTGG TGTCCAAGAA GACGGCGCGG CGTGGCTTAC CGAAGCGCGG CAGATGCTCG ACTTCAACCT GAAGCGCCTG GCGTACAGGG CACGATCCGG GAGGCTCGAA GGTGTTCGCC TTGAAGCTGG TACCTTGATC GTCACGCCGA CCGCCGGCGA GGTTCCTGCT GCAGCGGAGG AACTGAACGC CGAGATCAGC GAGCTTTATC CGTTGGTCGA GGTGCCGGAC CTCCTGCGGG AAGTGCACGA ATGGACCGGC TTTGCGGATT GCTTCACGCA TGTTCGAACG GGTGACACTC CGAGGAATGT CTCGGCCATG CTGGCTGGCG TACTGGCCGA TGCGACCAAT CTCGGTCCAA AGCGAATGGC CAGCGCGTCC AAAGGCATCA GCGCTCACCA GATCAGTTGG ATGCGAGCCT TCCATGCCCG GTCAGAGACC TACCGCGCGG CCCAGGCCTG CGTGACGGAC GCACACACCC GCCATCCGCA TTCTTGCCTT TGGGGCAATG GCACGACGTC ATCATCCGAT GGCCAATTCT TCCGAGCAAG CGACCGAGCC GCAAAGCGCG GAGATATCAA TCTACATTAC GGCAGTGAGC CCGGATCGAA GTTCTACAGC CATCTGTCAG ATCAGTACGG CTACTTCAGC ATCTTGCCCA TCAGCCCGAC CGAAAGCGAG GCTGCCTATG TGCTCGACGG ACTATTCGAT CAGGACACAA TCCTCGAAAT ACAGGAGCAC TTCACCGACA CCGGCGGCGC GAGCGATCAC GTCTTTGGGC TATTCGCTCT GATCGGCAAG CGGTTCGCAC CACGACTGCG CAATCTCAAA GATCGGAAGT TCCACACGTT CGAGAAAGGC GATGCATACC CGGCGCTGTC GAACCACATC GGGGCGCCGA TCAACACCAC CCTGATCCTC GATCACTGGG ATGATCTGCT TCATCTCGCG GCATCGATCA CCACCCGTGC CGTTGTGCCC TCTACGATTT TGAAGAAGCT CTCGGCATCA CCGAAGGAAA GCCAGCTGGC CAAGGCTCTT CGGGAACTCG GCCGCATCGA GCGGTCGCTC TTCATGACCG AATGGTACTC GAACTCGACA TTGCGCCGGC GCTGCCAAGC CGGCCTCAAC AAGGGCGAGG CAGCGCACAA ACTCAAACGC GCAGTCTTCT TCCATGAGCG TGGCGAACTC CGCGACCGGT CGTTCGAAAG TCAGGCATTC CGCGCATCGG GCCTCAATCT TGTCGTCAGC GCGATCGTCC ACTGGAACAC GGTCTATCTC GACCGCGCGG TCAAAGAGCT CAAACGAGCG GGAAGGAACA TTCCAGAGTC CCTGTTGAGG CATATCTCGC CACTGAGTTG GGAGCATATC AACCTGACAG GCATCTACAC CTGGGACAGC GAGCAACATC TCCCGGAAGG CTTCAGATTG CTTCGCCTCC CGGCTGGGCT ACGGCGTGCC GCACAACGTT CCTGCTCCGT TCGACCTTAG
|
Protein sequence | MGKRRLLKVQ DRQRLFDIPT DEDGLIRHYS LSSADRLEIG LCRREHNRLG FAVQLCLMRY PGRVLATDET PPRAMLEYVA EQIGADAGKF ALYARREETR RDHIARLMVY LAARSATGQD RRAALLAAIQ AATMSDDGGA IASATVAMFR ERGSLLPAID TIERIGLAAR AIARRRAERA LIEEISVDTL QSLDKLLEVD PAIGQTRFHW LRSAPDAPGT SNLVGLTERI AFLRELEIDP RLQIRISSGR WDQMIREGNA TPAWLANDFN ASRRHALIVA QIIKLGQKLT DDAVSMFIKL IGRLFSQANN RKKQRHMDCR PDTAKALRMF LDTITALQSA NDYGRNALEV LDQEVGWHRL LRMKPELESM VDDNEASPLT LAVEQYATVN KYAGAFLQAF TFRSARRHDP LLAAIFLLKR LYAEKRRTLP DRVPVTHLSQ VDRRLILGQE KPDRRLYEIA TLAALRDRLR SADIWVDGSR SFRPIDEHLM PRSTFTILKD EDRLGLGVQE DGAAWLTEAR QMLDFNLKRL AYRARSGRLE GVRLEAGTLI VTPTAGEVPA AAEELNAEIS ELYPLVEVPD LLREVHEWTG FADCFTHVRT GDTPRNVSAM LAGVLADATN LGPKRMASAS KGISAHQISW MRAFHARSET YRAAQACVTD AHTRHPHSCL WGNGTTSSSD GQFFRASDRA AKRGDINLHY GSEPGSKFYS HLSDQYGYFS ILPISPTESE AAYVLDGLFD QDTILEIQEH FTDTGGASDH VFGLFALIGK RFAPRLRNLK DRKFHTFEKG DAYPALSNHI GAPINTTLIL DHWDDLLHLA ASITTRAVVP STILKKLSAS PKESQLAKAL RELGRIERSL FMTEWYSNST LRRRCQAGLN KGEAAHKLKR AVFFHERGEL RDRSFESQAF RASGLNLVVS AIVHWNTVYL DRAVKELKRA GRNIPESLLR HISPLSWEHI NLTGIYTWDS EQHLPEGFRL LRLPAGLRRA AQRSCSVRP
|
| |