Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2131 |
Symbol | |
ID | 7085937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 2531739 |
End bp | 2534858 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643461034 |
Product | phage integrase family protein |
Protein accession | YP_002358058 |
Protein GI | 217973307 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.075091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.99757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACCA CTAAAGTCAG TGCTTTCATA ATCCAGCTAT CAAAAGAGGG GAAGCTACAT CCATTATACA TAGAATGCCG CACCGCATTT TTAAACAGCA TATCGCCATC TATTGCTGTA AAAAATAAAA AGTCAGAAAA AATTGAAAAC GAATTTTTTG CATCCTATCA CAATCTGCTT GGGCATCTAA AAAAGGAGAT TAAACTTTCT GATAACGCTT TTGTACCAGA AGAGTTAGAA AACGTGCTTG CCGCATTGAA GCCTGAAGTG CTAAGCAAAC AAGCACAACT TCTCATACTG CAATCTATAC TCAACTATGC CAAAAAGTTC TTAGACATAG ACTCGCCAAA CATTCCTTCA ATCATCACGT TAAAACGTGA TAAACCGATG TTGACCCCGG TAGATTTGAC AAAATTACCC ATAGTCGAGC TCATTAACAC TCAATTAAGC AGGGAGTTGA TAGCGCCTCA TAGAGATTTA GATGACAAGG CTAAGTTGGG GAGACTATTA CTCGTCTTAT ATTACACAGT GAACATAGAG AAAACTGAAC ATTTACAGCT TATGGTTGAA TACCCACAGG ACATATTCTA CGTGGGAGGA ATTTGTTATT GGCAATGCCA AAATATCAAA ATAAATTCGC CACGGTATGT GCTAAGTGAT ATGGCTGTCA TGGCCTTACA GCAATGGCAT AATGTGCACC ATAGCGCGTC TTTCGCAATA AAGCCCAGTC AGTTCAAAGC TGCTTTAGTG CATTATCTTA ATTACGGATC GTATTTCGAT TGGTCAGATA TATCACTATT GAAGCTAAGG GTGCTGCGAC GAATTGATAA CGTCCTCCGT TATGGCCCGG TTCAATATCG TATGTACATG TTACCTGGCG TTTGCCAGCC ATTGCCCACA CATGCGTTTG TACGTATCCT GACAGGAAAA GTTTATGCAA ACGCCGAACC GTACCTTTCA GAAGCCATTT CTGATAAAAC CAAGGCTTTT CGTGATTGGT CCGTTGTCAC GAACGAGAAA ACTTCTTTTG TGTCGATGGA AAAGACCCTC GAACAATTAG ATAAGGTATA TACCGAGCTC TGTAAACTTG TAGATAGCAA CAAACCTCGA GCCGATTGTC TTGGGGCTAT AGCAATCATT CTGGATCAAC AATCATCTAA GAGTAATCCA TACTTTTGGA TTTTATGTGC ATGGCTGTAT TCGTTACTAA AGACAGGTGG TACGCACAAG CGGCGGCTAA AAGTCTCGAC AACTATTGAT TATGTCAAAA GTTTAAGCCG GCCATTTCTA ATCATTTTTT GCACAAGCGA TATTTACTTA TTGTCTGGTG AGGATTGGGC TGAGAAATTA AATGATGCTG CTGACCATTT TTCTTCGGCA CAGCGAAAAA AATATATCTA CTATTTTGCG CAGTTTCTGA GTGATTCAGG CTTTGTTCGA GATTTATGTT TATCTGATAT TGAGGTGTTT GGGAGCTCCA GTGAGGTAGA TAGCAACCTG ATTTCCCATG AGCATGTGCA AGTCATGTTG TCCTATTTAG ATCATCAATC GGCACATTGT CCGGCGCACC ACGATGCCTA TTTTCTTTTG TGTTTCTGTT TTTACAGTGG ATTACGTCGA AATGAAGCCG CCAAGTTAAC CTGGGGGGAT TTTACATTCT CTCTGACTGA ACCAACCCAT AACAACTTTG ATTATGTTCA GCTTTCAGTG AGACCTAATA AACATAGGAC ATTAAAAACG ACCTCCGCCA GAAGAGAGTT ACCTTTAGAC GCTTTGTGGC CGAAAACGGC ATTGAGTAAG TTGCGATACA CTTATCGAAT CGTCAATGGA GCAGGGGCTA AGAATAGCAC TTTACTGTTT GATAATCAGA GAAGAGTGAA TCAGGCTTAT GACTTAATAA CAGACTTAAT GCGTCATTAC ACCCAAGATC AAAGTTTACG TGTACATCAT TTACGACACA GTTTTGCTAA TTGGACCTGG TGTCGTTTGA ATGCCAGCAT CATAGATACG GGCAGGCGAC ATTTGACCTT GTTCAATGAT GAGTTTTTTG ATGAAAAGTA TCTAAAAAGG TTGCAAACTC GTCTTTGCTA TAGTGATAAC ACCCGCAAGA AAATGTTTAT CCTGTCTCAT ATGATGGGCC ATAAAGACGT GCAATCGACG CTAAATAGTT ATCTTCATTT AAAAGACCTA CTGTATTACC TGCAGCACAA TTCGCGCTTT GAACTGACCA AATACTTTAC ATCAGAGTGC GTGGGTCGTG TGACACTGCA ACCCCTTGAA CCAGGATTGA GTCTAGCTGA ACGCATAACG TACTACACTC AGGACGTAAT GCACAAACTT GCCATTAAAC CAGCTCAACA AACAGTCAGC CTAGTGATGC CAAGTTTAGC GAACTTCTCT GTCAATATAA AAACAGACTT AACTATCAGT AGCTTAACTT GGGCCAAAGC ATTAAATGCG TTACATACAT CATCGACCAC CGAAGTTGCC GCGCACTATG TGGTTCCATT AGATCAATTA CAAAAGTTAC TCAGTAATGC CGAGGCTATT CATAAGAATT ATCCTCGCAG AGGTAGGCAC TTACCCTTAA TCCCTAAGTT TCCGCGTATA GACGTTTCAG TTGTTAACAT CCAACAAAAT AAACATTTAG CTAATTCAAC ACGTGTGTTT ATCTTTTTAT GTAATAAATT GGATAACAAC ATTAGTGAAG GTTCATTGAC CTTAGAAAAT ATTCGCCTAG GGATAGAAAT TTTGCGGTAT GCGGTTCCCG GTAAAAATTA CGCCTTACGC TGTCCAGACT CAAATATTTC AAGAATGTTT ATCCGCCTAT GTCAATTGCT GGATTTAAAA GCCAGGCATT TACAGTTTCG CTATCATAAT GCAAACCTAG CCCCTGAAAA GTCTAATCTT ATCCAGGCTA GATGGAGGAA GACACTTATC AAACATGGTT TTAGTGACAC TAACTTTGTC GTTGCTAGTG AGAGTGAAGG GCTTTATTTA GGCAAGCATG ATGGTAATGG GTTTCTAGAG ATAGCCGTAG TTAACAATTC ATATAAGCGA ATACAAAGAT ATCAAAGCAT ATTCAGTTTT TTACATTTGC TGCTTATTTT GAGCTATTGA
|
Protein sequence | MATTKVSAFI IQLSKEGKLH PLYIECRTAF LNSISPSIAV KNKKSEKIEN EFFASYHNLL GHLKKEIKLS DNAFVPEELE NVLAALKPEV LSKQAQLLIL QSILNYAKKF LDIDSPNIPS IITLKRDKPM LTPVDLTKLP IVELINTQLS RELIAPHRDL DDKAKLGRLL LVLYYTVNIE KTEHLQLMVE YPQDIFYVGG ICYWQCQNIK INSPRYVLSD MAVMALQQWH NVHHSASFAI KPSQFKAALV HYLNYGSYFD WSDISLLKLR VLRRIDNVLR YGPVQYRMYM LPGVCQPLPT HAFVRILTGK VYANAEPYLS EAISDKTKAF RDWSVVTNEK TSFVSMEKTL EQLDKVYTEL CKLVDSNKPR ADCLGAIAII LDQQSSKSNP YFWILCAWLY SLLKTGGTHK RRLKVSTTID YVKSLSRPFL IIFCTSDIYL LSGEDWAEKL NDAADHFSSA QRKKYIYYFA QFLSDSGFVR DLCLSDIEVF GSSSEVDSNL ISHEHVQVML SYLDHQSAHC PAHHDAYFLL CFCFYSGLRR NEAAKLTWGD FTFSLTEPTH NNFDYVQLSV RPNKHRTLKT TSARRELPLD ALWPKTALSK LRYTYRIVNG AGAKNSTLLF DNQRRVNQAY DLITDLMRHY TQDQSLRVHH LRHSFANWTW CRLNASIIDT GRRHLTLFND EFFDEKYLKR LQTRLCYSDN TRKKMFILSH MMGHKDVQST LNSYLHLKDL LYYLQHNSRF ELTKYFTSEC VGRVTLQPLE PGLSLAERIT YYTQDVMHKL AIKPAQQTVS LVMPSLANFS VNIKTDLTIS SLTWAKALNA LHTSSTTEVA AHYVVPLDQL QKLLSNAEAI HKNYPRRGRH LPLIPKFPRI DVSVVNIQQN KHLANSTRVF IFLCNKLDNN ISEGSLTLEN IRLGIEILRY AVPGKNYALR CPDSNISRMF IRLCQLLDLK ARHLQFRYHN ANLAPEKSNL IQARWRKTLI KHGFSDTNFV VASESEGLYL GKHDGNGFLE IAVVNNSYKR IQRYQSIFSF LHLLLILSY
|
| |