Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Maqu_3983 |
Symbol | |
ID | 4653464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinobacter aquaeolei VT8 |
Kingdom | Bacteria |
Replicon accession | NC_008739 |
Strand | + |
Start bp | 83215 |
End bp | 86181 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639809835 |
Product | transposase Tn3 family protein |
Protein accession | YP_957174 |
Protein GI | 120537117 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.948733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGTC GCTTGATCCT CTCGGCTACG GAGCGGGACA CCCTGCTTGC GTTGCCAGAA AGCCAGGATG ACCTGATCCG CTACTACACC TTCAACGACT CCGACCTGTC GCTGATCCGT CAGCGGCGCG GCGACGCCAA CCGGCTGGGC TTTGCGGTGC AGCTCAGCCT GTTGCGTTAC CCCGGCTATG CGTTGGGCAC CGACAGCGAG CTGCCAGAGC CAGTCATCCA GTGGGTGGCT AAGCAAGTTC AGGCCGATCC GGAGAGCTGG GCGAAGTACG GCGAGCGCGA CGTGACCCGT CGCGAGCATA CCCAGGAACT TCGCACCTAC CTGCAACTGG CCCCGTTCGG ACTGTCGGAC TTCCGCGCCC TGGTGCGCGA ATTGACCGAG CTGGCCCAGC AGACCGACAA GGGCTTGCTG CTGGCCGGTC AGGCGCTGGA GAGTTTGCGG CAGAAGCGGC GCATCCTGCC GGCGCTGAGC GTAATTGATC GAGCTTGTTC GGAGGCCATT GCGCGGGCCA ATCGGCGGGT CTATCGCGCC CTGGTCGAGC CGCTAACGGA CTCGCATCGG GCCAAGTTGG ACGAGCTGTT GAAGCTCAAG GCCGGCAGCA GCATCACCTG GTTGACCTGG CTGAGGCAGG CACCGCTTAA GCCGAACTCT CGGCACATGC TCGAACACAT CGAGCGGCTG AGGACATTTC AGCTGGTGGA TTTGCCCGAA GGCCTGGGCC GGCACATCCA CCAGAACCGC CTGCTCAAGC TGGCCCGCGA GGGCGGGCAG ATGACGCCCA AAGACCTCGG CAAGTTCGAG CCGCAGCGGC GCTACGCGAC CCTGGCCGCC GTGGTGCTGG AGAGCACCGC GACCGTGATT GATGAGTTGG TCGATCTGCA CGACCGCATC CTGGTCAAGC TGTTCAGCAG CGCGAAACAC AAGCATCAGC AGCAGTTCCA GAAGCAGGGC AAGGCGATCA ACGACAAGGT GCGCCTGTAC TCCAAGATCG GCCAGGCGCT TCTGGAGGCC AAGGAAACCG GCAGCGATCC CTATGCCGCC ATCGAGGCGG TGATTCCTTG GGACGAGTTC ACCGAGAGCG TCAGCGAGGC TGAGCTACTG GCCCGACCGG AGGGCTTCGA TCATCTGCAC CTAGTCGGCG AGAATTTCGC CACCCTGCGC CGCTACACGC CGGCCTTGCT GGAGGTGCTG GAACTGCGCG CCGCGCCGGC CGCGCAAGGG GTACTGGCCG CTGTGCAGAC CCTACGCGAA ATGAACGCCG ACAACCTGCG CAAGGTGCCA GCCGACGCAC CCACAGCCTT CATCAAGCCG CGCTGGAAGC CGCTGGTGAT CACCCCGGAA GGCCTCGACC GGCGTTTCTA TGAAATATGT GCGCTGTCCG AGCTGAAGAA CGCCCTGCGT TCCGGCGACA TCTGGGTCAA GGGCTCGCGG CAGTTTCGCG ACTTCGACGA CTACCTGCTG CCGGCAGAGA AGTTCGCCGC GCTTAAGCGC GAGCAGGCCC TGCCCCTGGC GATCAACCCG AGCAGCGACC AGTACCTGGA AGAGCGTTTA CAGCTGCTGG ACGAGCAGTT GGCCACCGTC ACCCGGCTGG CCAAGGACAA CGAGCTGCCC GATGCCATCC TCACCGAGTC CGGGCTGAAA ATCACCCCGC TGGATTCTGC GGTGCCCAAC ACCGCGCAGG CGCTGATCGA CCAGACCAGT CAGCTGTTGC CGCGCATCAA GATCACCGAA CTGCTGATGG ACGTGGACGA CTGGACGGGT TTCAGCCGCC ACTTCACCCA CCTGAAGGAC GGTGCCGAGG CCAAAGACCG GACATTGCTG CTGTCAGCAA TCCTGGGCGA TGCGATCAAC CTCGGGCTGA CCAAGATGGC CGAGTCGAGC CCCGGCCTGA CCTACGCCAA GCTGTCCTGG CTGCAAGCCT GGCACATCCG CGACGAAACC TACTCGGCGG CCCTGGCCGA GCTGGTCAAC CACCAGTACC GTCATACCTT CGCCGCTCAC TGGGGCGACG GTACTACCTC TTCTTCCGAT GGCCAGCGCT TCCGGGCGGG CGGCCGGGGC GAAAGCACCG GGCACGTCAA CCCGAAGTAC GGCAGCGAGC CGGGGCGGCT GTTCTACACC CATATCTCCG ACCAGTACGC GCCCTTCAGC ACCCGCGTGG TGAATGTCGG CGTGCGTGAC TCCACCTATG TGCTCGACGG CCTGCTGTAC CACGAGTCCG ACTTGCGGAT CGAGGAGCAC TACACCGACA CGGCCGGTTT CACCGATCAC GTCTTCGCCC TGATGCACCT GCTGGGCTTC CGCTTCGCAC CGCGCATCCG CGACCTCGGC GAAACCAAGC TGTATGTTCC GAATAGCGTC CAGGACTACC CGACATTGCG CCCAATGGTT GGTGGCACCC TGAACATCAA GCACGTCCGC GCCCATTGGG ACGACATCCT GCGCCTGGCC AGCTCGATCA AGCAGGGCAC GGTCACCGCC TCTCTGATGC TGCGCAAGCT CGGCAGCTAT CCGCGCCAGA ACGGCCTGGC CGTGGCCCTG CGCGAGCTGG GCCGGATCGA GCGCACGCTG TTCATCCTCG ACTGGCTGCA AAGCGTCGAG CTGCGCCGCC GTGTACATGC AGGGCTGAAC AAGGGTGAGG CGCGCAACTC CCTGGCCAGG GCGGTGTTCT TCAACCGCCT AGGCGAGATC AGGGATCGGA GTTTCGAACA GCAGCGCTAC CGGGCCAGTG GCCTCAACTT GGTGACCGCC GCCATCGTGC TGTGGAACAC GGTGTACCTG GAGCGCTCCA CCCAAGCAAT GGGCGAGGCT GGAAAGCAGG TGAATGGCGA GTTGCTGCAA TACCTGTCGC CGCTGGGCTG GGAGCACATC AACCTGACTG GCGATTACGT CTGGCGGCAG AGCCGCAGGC TGGAGGACGG GAAGTTCAGG CCGCTACGGT TGCCCGGAAA ACCTTAG
|
Protein sequence | MPRRLILSAT ERDTLLALPE SQDDLIRYYT FNDSDLSLIR QRRGDANRLG FAVQLSLLRY PGYALGTDSE LPEPVIQWVA KQVQADPESW AKYGERDVTR REHTQELRTY LQLAPFGLSD FRALVRELTE LAQQTDKGLL LAGQALESLR QKRRILPALS VIDRACSEAI ARANRRVYRA LVEPLTDSHR AKLDELLKLK AGSSITWLTW LRQAPLKPNS RHMLEHIERL RTFQLVDLPE GLGRHIHQNR LLKLAREGGQ MTPKDLGKFE PQRRYATLAA VVLESTATVI DELVDLHDRI LVKLFSSAKH KHQQQFQKQG KAINDKVRLY SKIGQALLEA KETGSDPYAA IEAVIPWDEF TESVSEAELL ARPEGFDHLH LVGENFATLR RYTPALLEVL ELRAAPAAQG VLAAVQTLRE MNADNLRKVP ADAPTAFIKP RWKPLVITPE GLDRRFYEIC ALSELKNALR SGDIWVKGSR QFRDFDDYLL PAEKFAALKR EQALPLAINP SSDQYLEERL QLLDEQLATV TRLAKDNELP DAILTESGLK ITPLDSAVPN TAQALIDQTS QLLPRIKITE LLMDVDDWTG FSRHFTHLKD GAEAKDRTLL LSAILGDAIN LGLTKMAESS PGLTYAKLSW LQAWHIRDET YSAALAELVN HQYRHTFAAH WGDGTTSSSD GQRFRAGGRG ESTGHVNPKY GSEPGRLFYT HISDQYAPFS TRVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG ETKLYVPNSV QDYPTLRPMV GGTLNIKHVR AHWDDILRLA SSIKQGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNSLAR AVFFNRLGEI RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERSTQAMGEA GKQVNGELLQ YLSPLGWEHI NLTGDYVWRQ SRRLEDGKFR PLRLPGKP
|
| |