Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aave_2941 |
Symbol | |
ID | 4666874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax citrulli AAC00-1 |
Kingdom | Bacteria |
Replicon accession | NC_008752 |
Strand | - |
Start bp | 3230650 |
End bp | 3233625 |
Gene Length | 2976 bp |
Protein Length | 991 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639824140 |
Product | transposase Tn3 family protein |
Protein accession | YP_971282 |
Protein GI | 120611604 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.360292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGTCA GCTTCCTGTC CACCACGCAA CGGGAACGCT ACGGCCGCTA TCCAGAGGCG CTTTCCAGCG AGGAACTGGG GCGTTACTTC CACCTGGACG ACGACGACCG CGAGTGGATC GCCACCAAGC GGCGCGACAG CAGCCGCCTC GGTTACGCAC TGCAACTGAC GACGGCGCGG TTTCTCGGCA CCTTTCTGGA AGACCCTACC GCCGTGCCAA GCCCGGTGCT GCATACGCTG TCGTCGCAAC TTGGCATCGC CGACCCTTCC GATTGTGTTA TCGACTACCG GACGACCCGG CAGCGCTGGC AGCACACGAC CGAGATTCGC GCTCGCTACG GCTACCGCGA ATTCGCCGAA CGTGGCGTGC AGTTCCGCCT TGGCCGCTGG CTGTGCGCGC TGTGCTGGAC GGGCACCGAC CGTCCGAGTG CGCTGTTCGA CTACGCCAAC GGTTGGCTGG TCGGCCACAA GGTACTGCTG CCCGGCGTCA CGGTGCTGGA ACGCTTTATC GCCGATATAC GCTCGCGCAT GGAGTCGCGC CTGTGGCGTT TGCTGGTGCG CGGCGTGACG GTCGCACAGC GGCAGCGTCT CGAAGACTTG CTCAAGCCTG CCGAAGGCAG CCGCCAGTCC TGGCTGGATC GGCTGCGCAA GGGGCCGGTG CGCGTCAGCG CTCCGGCGCT TGTGATGGCC TTGCTGCGCA TCGAAACCGT GCGGGATCTG GGCATCAAAC TGCCCGGCAC CCATGTGCCA CCAAGCCGGA TCGCGGCACT CGCCCGCTTT GCCAGTACGG TCAAGGTATC CGCCGTGGCC AGGCTGCCGG AGGCGCGGCG CATCGCCACG CTGGTCGCCT TCGTGCATTG CCTGGAAGCC AGCGCTCAGG ACGATGCCCT TGATGTGCTC GACCTGCTGC TGCGCGAACT GTTCACCAAG GCTGAGAAGG AAGACCGCAA GTTCAGGCAG CGCTCCCTCA AAGATCTGGA TCGGGCTGCC TCGACGCTGG CTGAGGCGTG CCGGATGCTG CTCGATCCCG GTTTGCCGGA CGGCGAACTA CGCGAGCGTG TCTATGCCGC CATCGGCCGC GATGAACTGG CCCAGGCGCT CAACGAAGTT CGCGGCCTGG TGCGCCCGCC CAACGATGTG TTCTACACCG AACTGGAAGC CAGGAAGGCC ACCGTCTCGC GCTTCCTGCC GACATTGCTG CGCGTCATCC GCTTCGACGC CAATCCAGCC GCGCAGCCTT TGGCGCAGGC GTTGAAATGG CTGCATGAGA AGCCCGACCA TGATCCGCCC ACGGCCATCG TCGGCAAAGC GTGGCAACGC CATGTCGTGC AGGAGGACGG CCGGATCAAT GCCACGGCCT ATTCTTTCTG CGCGCTCGAC AAGCTGCGCA GCGCGATCCG CCGCCGCGAC ATGTTCATCA GCCCGAGCTG GCGTTACGCC GATCCGCGTG CCGGACTGCT GGCAGGAGCC GAGTGGGAGG CCGCACGGCC CATCGTCTGC CGCTCGCTGA GCCTGACGGC GCAACCGGAA GCAACGCTGG CGGCACTCAC GCGCGAACTG GACAAAACCT ACCGGCGCGT CGCCGCTCGC CTGCCCGAGA ACGACGCGGT GCGCTTCGAG ACGGTCGGCG ACAAGACCGA ACTGGTGCTC AGCCCCTTGG AAGCGTTGGA AGAACCAACT TCGCTGATCG CGCTGCGCAA CGAAATCAAG GCGCGCATGC CGCGCGTCGA TCTGCCGGAA ATCCTGCTGG AAGTCGCCGC GCGTACTGGC TGCATGGATG CCTTCACGCA CCTGACCGAG CGCACGGCGC GTGCGGCCGA CCTGACCACC AGCTTGTGCG CGGTGCTGAT GGCTGAAGCC TGCAACACCG GCCCGGAACC GCTGGTGCGG CAGGACACCC CGGCGCTCAA ACGCGACCGG CTGATGTGGG TCGATCAGAA CTATGTGCGT GATGACACGC TGGTTGCCTG CAACGCCGTG CTGGTGGCGG CGCAAAACCG CATCGCATTG GCGCGCACCT GGGGCGGCGG TGACGTGGCC TCCGCCGACG GCATGCGCTT TGTGGTGCCG GTACGGACCA TCCACGCCGC GCCGAACCCG AAATACTTCA ATCGCGGGCG TGGCGTCACC TGGTACAACC TGCTGTCCGA TCAATGTACT GGGCTGAACG CGATCACCGT GCCCGGCACG CTGCGCGACA GCCTGGTCTT GCTGGCGGTC GTGCTGGAGC AGCAGACCGA GTTGCAGCCG ACACAGATCA TGACCGACAC CGGTGCGTAC AGCGATTTGG TGTTTGGCCT GTTCAGGCTC TCCAACTACC GCTTCTGCCC GCGCCTGGCC GATGTCGGCG GCACACGCTT CTGGCGTGTC GATCCCGACG CTGACTATGG CGAGCTCAAC GCGCTCGCCC GGCAGCGTGT GAACCTCGAC CGCATCACGC CGCATTGGGA TGACGTGCTG CGCCTGGTCG GCTCGCTCAA GCTCGGCCTG GTACCGGCGA TGGGCATCAT GCGCACCTTG CAGGTCGATG AACGGCCGAC CAGCCTAGCG CAGGCCATCG CCGAAATCGG TCGCATCGAC AAGACCATCC ACACGCTGAA CTTCATCGAC GACGAGGCCC GCCGCCGCGC CACGCTTCTG CAATTGAACC TCGGCGAAGG CCGCCACAGT TTGGCGCGCG AGGTTTTTCA CGGCAAGCGC GGCGAACTGT TCCAGCGCTA CCGCGAAGGA CAGGAAGACC AGTTGAGCGC GCTCGGCCTG GTTGTGAACA TGATCGTGCT GTGGAACACG CTGTACATGG ACGCGGTACT GGCGCAGTTG CGCAGCGAGG GCTACCCGAT CCGCCCCGAA GACGAGGCGC GGTTGTCGGC GTTCGTCCAC GAGCACATCA ATATGCTCGG ACGCTACTCG TTCTCGGTGC CGGAAGCAGT CGCGCGTGGC GAACTGAGAC CGTTGACCAA ACAAAATGAA CCTTAA
|
Protein sequence | MPVSFLSTTQ RERYGRYPEA LSSEELGRYF HLDDDDREWI ATKRRDSSRL GYALQLTTAR FLGTFLEDPT AVPSPVLHTL SSQLGIADPS DCVIDYRTTR QRWQHTTEIR ARYGYREFAE RGVQFRLGRW LCALCWTGTD RPSALFDYAN GWLVGHKVLL PGVTVLERFI ADIRSRMESR LWRLLVRGVT VAQRQRLEDL LKPAEGSRQS WLDRLRKGPV RVSAPALVMA LLRIETVRDL GIKLPGTHVP PSRIAALARF ASTVKVSAVA RLPEARRIAT LVAFVHCLEA SAQDDALDVL DLLLRELFTK AEKEDRKFRQ RSLKDLDRAA STLAEACRML LDPGLPDGEL RERVYAAIGR DELAQALNEV RGLVRPPNDV FYTELEARKA TVSRFLPTLL RVIRFDANPA AQPLAQALKW LHEKPDHDPP TAIVGKAWQR HVVQEDGRIN ATAYSFCALD KLRSAIRRRD MFISPSWRYA DPRAGLLAGA EWEAARPIVC RSLSLTAQPE ATLAALTREL DKTYRRVAAR LPENDAVRFE TVGDKTELVL SPLEALEEPT SLIALRNEIK ARMPRVDLPE ILLEVAARTG CMDAFTHLTE RTARAADLTT SLCAVLMAEA CNTGPEPLVR QDTPALKRDR LMWVDQNYVR DDTLVACNAV LVAAQNRIAL ARTWGGGDVA SADGMRFVVP VRTIHAAPNP KYFNRGRGVT WYNLLSDQCT GLNAITVPGT LRDSLVLLAV VLEQQTELQP TQIMTDTGAY SDLVFGLFRL SNYRFCPRLA DVGGTRFWRV DPDADYGELN ALARQRVNLD RITPHWDDVL RLVGSLKLGL VPAMGIMRTL QVDERPTSLA QAIAEIGRID KTIHTLNFID DEARRRATLL QLNLGEGRHS LAREVFHGKR GELFQRYREG QEDQLSALGL VVNMIVLWNT LYMDAVLAQL RSEGYPIRPE DEARLSAFVH EHINMLGRYS FSVPEAVARG ELRPLTKQNE P
|
| |