Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_4353 |
Symbol | |
ID | 7090140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011664 |
Strand | + |
Start bp | 21836 |
End bp | 24868 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643463225 |
Product | transposase Tn3 family protein |
Protein accession | YP_002360237 |
Protein GI | 217975567 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATTG CTAGAAGCGA GCGACTAACA GTTCTGTCGG ATGCCGAGCA GGAGGCGCTA TATGGCTTGC CGGACTTTGA CGATGCGCAG CGGCTGGAAT ACCTGGCCTT GACCGAAAAT GAGCTGGCGC TCGCCAGCAG CCGACCAAGC ATCCATGCTC AGCTCTATTG CATCCTGCAG ATTGGTTACT TCAAAGCCAA GAACGATTTC TTCCGTTTCC ACTGGCGTGA CGTCGAGGAT GATTGCGCTT TCGTGCTCAG CCGATATTTT CAAGGTGAGG CATTCGAGGC CAAGCAGATC TCCAAACACG AGCACTACAG CCAGCGAGGT AAAATTGCCG CGCTATTTGG CTACCAGCCG TGGGCCTCCA GCTACCTGCC ATCACTTGAG CAACAGGCCT CACTGATCGT GCGGCGCGAC GTGACGCCCA GCTTTGTGGC CGCTGAGCTG ATCGTCTGGC TCAACGAACA CAAGATCATT CGTCCCGGCT ATACCACCCT GCAAGCGTTG GTCAGCGAAG TCCTGTCAGT CGAACGTCGA CGACTGGGAG CCCTGCTAGC ACAGGTGTTG GATGAACCGG CCAAGACTGC GCTGAGCCAG CTCCTGGTGC GCGACGATAC GTTGTCACAA CTGGCGGCAC TCAAGCAAGA CGCCAAGGAT TTCGGGTGGC GTCAGATGGC CAGAGAGCGT GAGAAACGTG CCCTGCTGGC GCCGTTGCAC GAGATTGCCA AGGGGTTGCT GCCCAAACTG GGGATCTCTC AACAGAACCT GTTGTATTAC GCGAGTCTGG CTAACTTCTA TACCGTCCAT GATTTGCGCA ACTTGAAGGC AGAACAGACC CAGCTCTACC TGTTGTGCTA TGCCTGGGTA CGCTATCGGC AGCTCAGCGA TAACCTGGTC GATGCGATGG CGTTCCACAT GGATCAACTT GATCACGAAA ACCGAGCAGG AGCCAGGGAC ACTTTCGCAG AGGAACAGGT CAAACGGCAT CAGGAAACGC CCCAGGTTGG TCGCTTGCTG TCCCTGTACG TGGACGACAG TGTTGCGGAT CCGACGCCAT TCGGCGAGGT ACGCCAACGA GCCTACAAGA TCATGGCCAA AGACGTGCTG CAAAACACGG CGCAGCGCAT GAGCGTCAAG CCACTGAATA AACTGACATT GCACTGGCAG GCGGTGGATG GCCAAGCCAA ACGTATCCGT CGTCATCTCA GACCTTTGTT TGTTGCGCTG CAGTTCGCCG CCACCGATCC GGACAGCCAG TGGCTGGCGG CACTGACCTG GGCCAAAAGC GTGTTTGCCA AGCAGCAGCG CCTGTCACAA CGACCACTCG CTGAATGCCC GGTAGCCACA CTCCCCAAAC GCTTGCGGCC CTACCTGCTG ACCTTTGATG CGGATGGTGA GCCGACAGGG CTACATGCTG ACCGCTACGA GTTCTGGTTG TACCACCAAG TCAAAAAACG CTTCCAGTCT GGCGAGCTCT ATCTCGATGA CAGTTTGCAG CACCGGCATT TCTCTGATGA GTTGGTGTCT ATGGAAGAGC ACGCCCAGGA GCTGGCACAG ATGGACATCC CGTTTCTGCG CCAACCGATT GAAACACAGC TCGATGTCTT GGCCACCGAG CTGCACAGGC AATGGCAGGC ATTCAACCGT GAACTGAAAC AGGGCAAGCT GACACACCTC GAATACGACA AAGATGCGCA AAAACTGACC TGGCGCAAGC CCAAGAGGGA AAACCAAAAG GCGCGGGAAC ATACGTTTTA CGAGCAATTA CCGTACTGCG ATGTCGCCGA TGTGTTCCGC TTCGTCAACA ACCAGTGTCA GTTCCTATCG GCACTGACTC CGCTGCAACC ACGTTACGCC AAGAAGGTGG CTGATACCGA CAGCCTGATT GCGGTCATCA TCGCCCAGGC CATGAACCAC GGTAACCTGG TCATGGCGCG TACCAGCGAT ATTCCATACC ACATCCTGGA GAGCGCATAT CAGCAGTACC TGCGGCAAGC ATCGCTACAC GAGGCCAATG ACTTCATCAG CAATGCCATT GCTGCGCTGC CTATCTTCCC GTATTACTCG TTCGACCTCG ATGTGCTGTA CGGCGCTGTC GATGGGCAGA AATTCAGCGT TGAGCGGCCA ACCGTGAAGG CGCGCTACTC GCGCAAGTAC TTCGGGCGCG GCAAGGGTGT GGTGGCCTAT ACGCTCTTGT GTAACCACAT CCCGTTGAAC GGCTACCTGA TTGGTGCCCA TGATTACGAG GCCCACCATG TGTTCGACAT CTGGTATCGC AATACCTCTG ACATTGTGCC AGAGGTCATT ACTGGCGACA TGCACAGTAT TAACAAGGCC AATTTCGCTA TTCTGCACTG GTTCGGGCGG CGTTTCGAAC CACGCTATAC CGACCTCAAC CACCAGTTGC AGGCGTTGTA TTGTGTGGGC GACCCGGCGC GATACGAGAA GTGCCTGATC CAGCCAGTTG GCCAAATAGA TAGGCAGTTG ATTGTCAGCG AAAAGGCGAA CATTGATCGG ATCATCGCCA CACTGGGTTT GAAGGAGATG ACGCAGGGAA CGTTGATCCG CAAACTGTGC ACCTACACCG CGCCAAACCC AACGCGGCGG GCCATCTTCG AGTTTGATAA GCTCGTGCGC AGCATCTACA CGCTACGCTA CCTGCGCGAC CCCCAATTGG AACGTAACGT ACATCGCTCA CAAAACCGGA TTGAGTCATA CCATCAGCTA CGTTCAAGTA TTGCGCAGGT GGGGGGCAAG AAAGAATTGA CTGGGCGGAC CGACATTGAG ATCGAGATCA GCAACCAGTG CGCCAGGCTG ATCGCCAACG CAGTCATCTA CTACAACTCG GCAATCCTGT CGCGGTTGCT AACGAAGTGC GAAGCGGCAG GTAACGCCAA GGCCATCTCG GCGCTCACCC AAATATCGCC GGCAGCCTGG CGGCACATTT TGCTGAACGG GCATTATACC TTTCAGAGTG ACGGTAAGGT GATTGACCTG GACACGCTTG TGGCTGGGGT TGAACTGGTA TGA
|
Protein sequence | MAIARSERLT VLSDAEQEAL YGLPDFDDAQ RLEYLALTEN ELALASSRPS IHAQLYCILQ IGYFKAKNDF FRFHWRDVED DCAFVLSRYF QGEAFEAKQI SKHEHYSQRG KIAALFGYQP WASSYLPSLE QQASLIVRRD VTPSFVAAEL IVWLNEHKII RPGYTTLQAL VSEVLSVERR RLGALLAQVL DEPAKTALSQ LLVRDDTLSQ LAALKQDAKD FGWRQMARER EKRALLAPLH EIAKGLLPKL GISQQNLLYY ASLANFYTVH DLRNLKAEQT QLYLLCYAWV RYRQLSDNLV DAMAFHMDQL DHENRAGARD TFAEEQVKRH QETPQVGRLL SLYVDDSVAD PTPFGEVRQR AYKIMAKDVL QNTAQRMSVK PLNKLTLHWQ AVDGQAKRIR RHLRPLFVAL QFAATDPDSQ WLAALTWAKS VFAKQQRLSQ RPLAECPVAT LPKRLRPYLL TFDADGEPTG LHADRYEFWL YHQVKKRFQS GELYLDDSLQ HRHFSDELVS MEEHAQELAQ MDIPFLRQPI ETQLDVLATE LHRQWQAFNR ELKQGKLTHL EYDKDAQKLT WRKPKRENQK AREHTFYEQL PYCDVADVFR FVNNQCQFLS ALTPLQPRYA KKVADTDSLI AVIIAQAMNH GNLVMARTSD IPYHILESAY QQYLRQASLH EANDFISNAI AALPIFPYYS FDLDVLYGAV DGQKFSVERP TVKARYSRKY FGRGKGVVAY TLLCNHIPLN GYLIGAHDYE AHHVFDIWYR NTSDIVPEVI TGDMHSINKA NFAILHWFGR RFEPRYTDLN HQLQALYCVG DPARYEKCLI QPVGQIDRQL IVSEKANIDR IIATLGLKEM TQGTLIRKLC TYTAPNPTRR AIFEFDKLVR SIYTLRYLRD PQLERNVHRS QNRIESYHQL RSSIAQVGGK KELTGRTDIE IEISNQCARL IANAVIYYNS AILSRLLTKC EAAGNAKAIS ALTQISPAAW RHILLNGHYT FQSDGKVIDL DTLVAGVELV
|
| |