Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_04165 |
Symbol | |
ID | 9296322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 950515 |
End bp | 953730 |
Gene Length | 3216 bp |
Protein Length | 1071 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | outer membrane protein |
Protein accession | YP_003715597 |
Protein GI | 298207418 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAC TTTTCATTGG CTTTGCGTTA TTGCTTTCTG CAATGAGCTT TGCCCAAACT ACTATTACCG GAACTGTTAC AACCACAGCT TCTGGAGAAA CAGTGCCATT TGTAAATGTT ATTTTAAATA ATAGCACGAC AGCTACAACT ACAGATGATA ATGGCCGCTA CAGTATCGAC ATAAATTCAG AATTAGATGT CCTTAAATTT TCTGCTTTAG GCTTTATTTC TCAATCTATA ACAGTTGGTA ACAAGACCAT AATAAATGTA GCATTAGAAG AATCTACAAC AGACCTAAAT GAAATTGTAA TTACGGCTTT AGGATTTAAA CGAGAAACCA AAGAGTTAGG ATATGCCGTA CAAAGTTTAG GTAGTGATGA TATTCAAGAA GTTAAAGCTG TAAATTTTCT AGATAATTTA AGCGGAAAAT TAGCAGGTGT TACTATAAGT CAAGGAGCAA CAGGTGTTGG TTCTACTTCA AAAATAACAA TACGAGGTGA GGCTTCATTT TCTAATAACA ATCCCTTATT TGTTGTAGAC GGTACACCTA TTAATAATAA TACGGTTTTT AATTTCACAA ATGAAGCAGC AGCAGGTTTT CAAGAAGTAG ATTTTGGTAA TGGTGCTATG GAGGTTAATC CAGATGATAT CGCTTCGGTT TCAGTATTAA AAGGACCAAG TGCAGCAGCA CTTTATGGTA CTAGAGCATC TAACGGCGTT ATTGTTATAG AAACTAAGAA TGGAGCAAAC AAAAAAGGCT TAGGAGTAAG TTATAATACC AGCCTTTTTA TAGATACTGC ATTTAGGTTG CCAGATTTTC AAAATGAGTA CGGCCAAGGA AATTCTGGAG AGTTTGAATA TGTAGATGGC TTAGGTGGCG GTATAAACGA TAATATAACA TATTCTTGGG GGCCAAGGTT AGACCAAGGC TTACTCATAC CACAGTTTGA CAGTCCTGTT GTACTTGCTA ATGGCACAAT TGTAAGAGGT GGCGATACTT CTGTTTATGA TGGCCAACCA ATAACTCCTA CAGCATTTAA TTCAAATCCT GATAACCTTA AGGATTTTTA TGAGACAGGT GTAACAACTA TTAACAACCT ATCAATAGCT ACTGGTTTTA ATACTGGAGA TTTTAGGTTA TCTCTTACAG ACTTAAGAAG TGATGGTATT ATTCCTGGCG TTAACTTAGA CAGACAAACC ATATCTACAA AGCTAAATTT TACACCAACT CAAAAAACAA AAATCACGTC TAACATAAGT TATGTAAACT CTCAAAGTGA TAATAGACCA TCAAATGGAT ATGGTAGTGA AAATGTAAAT TACTCTTTAG TAGCTTGGGG ACCAAGATCT TTAAATATAG ATAGCTTAAG AGATTACTGG CAACCTGGTT TAGAAGGAGT ACAACAATAC TCTTTTAACT ACACCTTTTT TGACAATCCA TATTTTATCC TTTTTGAAAA TAGAAATAGT TTTAATAGAG ATCGTGTTTT TGGAAATGTT TCTATCAAAC ATAATTTTAC AGAAAAGCTA AGTGTTGCAG TAAGGTCTGG AATGGATTAC AGTAATGAAA AACGTCAATT CTTAAGAAAT TTTAGTTCAA ATAGATTTAA AAATGGTGCG TACGCAGAGC ACGATGTTTT TTTTAGAGAA ATAAATACAG ATATCTTAGT AAACTATCAA GATATTGTTG GAGACCTTTC TTTTGATGTG TCTTTAGGAG GAAACAGATT AGATCAAAAT GCATCTACCA AACAATCTCA AGCAACTAAT TTAGCACAAC CTGGTATATT TAGTCTTAAC AATGCAGCTT CTCCTATTGA AGTATTTCAG TTTGAGTCTC AAAAGCGAAT CAATTCTATT TATGGCTTGG CAAAATTTGG GTATAAAGAC TACCTATTTT TAGACATTAC AGGAAGAAAC GATTGGTCTA GCGCATTAGC AACTCCTTTT TCGGTAGATG GTACATCTTT CTTTTACCCT TCAGCTTCAG CAAGTTTTAT ATTGTCTGAG GTAGCAACGC TTCCAAGCAT ATTTTCTTAC GCTCAGCTAA GAGCAAGCAT AGCACAAGTA GGTAATGATA CTAACCCATA CCAAACCTCA GGAACGTTTG TCTCTCAAAC CCCATTTAAT AGTCAACCTA CATTTAGCAA TCAGGATTTA ATTCCAAATG CAAACCTAAA GCCAGAAAGC ACCACATCTT ATGAGGCAGG TTTTGATGTT CGTTTTTGGA GAGATCGTCT TAATTTAGAT TTTACGTATT ATAATGCATT AACTAAAGAT CAAATAATAT CATTACCAAT TGGTATATCC TCAGGATATA ACCAACAGGT AGTTAACGGA GGAAAAGTTC GAACAGAAGG TGTTGAAATT ATTGCAGGCT TAATACCAAT CATAACAGAT AAATTTAGGT GGACAACCAC ATTTAATTTT AGTAAAAGTG TTGCAACTGT TGAAGACTTA CCACAAGATG ATGGTCGCCT AACTTTAGGC TTAAGTAGAA TTTATGACAG TGCTAACCAA ACGGTTTTCT TTCAAGTTGA AGAAGGTGGT CGCGTAGGTG ATTTTTATGG AACAGGCTAT CTTAAAAACG AAAACGGCGA TTTTATCCTA ACCGATGACG GAAGATATAT TGCAGATAAT AATTTACAGA AATTTGGAAA CTATAATCCA GATTTTATGC TAGGATGGAA CAACCAATTC TCCTACGGTA ACTGGAATTT GAGCTTCCTT TTTGATTGGA GACAAGGTGG CGAAATTGTA TCTAGAACTA GAGCTTTAGG TAATGTAGGT GGACAATTGG CAGAAACTGC TTTTAGACCT GAAGGCGGCA TTATAGCTCA AGGTGTTGTA AATACGGGTA CTGCAGAAAA TCCTAACTAT ATTCCTAATA CAACTGCAGT AACTGCAGAA AGCTACTACC GTCAATTTTA TGATAGAAAC CACGAAGAAA ATAATATATA CGATGCCTCT TATTTAAAGC TAAGACAGTT TTCTGTGGGT TATACCTTTA AATTAAATGA TGGATTTATA GGACTTAAGG AAGGTGTAGA TGTAAACCTT TCTTTAGTTG GAAGAAACTT ATTTGCTATT ACAGAAAACC CGCATTTTGA TCCAGAGCAA TTAGCCGTAC AAGGACAAAG TTTTGTAAGT GGTGTAGAAG ATATGAGTTA TGCCACAACA AGAAGTATAG GTTTTAAAGC CGGATTTAAT TTCTAA
|
Protein sequence | MKQLFIGFAL LLSAMSFAQT TITGTVTTTA SGETVPFVNV ILNNSTTATT TDDNGRYSID INSELDVLKF SALGFISQSI TVGNKTIINV ALEESTTDLN EIVITALGFK RETKELGYAV QSLGSDDIQE VKAVNFLDNL SGKLAGVTIS QGATGVGSTS KITIRGEASF SNNNPLFVVD GTPINNNTVF NFTNEAAAGF QEVDFGNGAM EVNPDDIASV SVLKGPSAAA LYGTRASNGV IVIETKNGAN KKGLGVSYNT SLFIDTAFRL PDFQNEYGQG NSGEFEYVDG LGGGINDNIT YSWGPRLDQG LLIPQFDSPV VLANGTIVRG GDTSVYDGQP ITPTAFNSNP DNLKDFYETG VTTINNLSIA TGFNTGDFRL SLTDLRSDGI IPGVNLDRQT ISTKLNFTPT QKTKITSNIS YVNSQSDNRP SNGYGSENVN YSLVAWGPRS LNIDSLRDYW QPGLEGVQQY SFNYTFFDNP YFILFENRNS FNRDRVFGNV SIKHNFTEKL SVAVRSGMDY SNEKRQFLRN FSSNRFKNGA YAEHDVFFRE INTDILVNYQ DIVGDLSFDV SLGGNRLDQN ASTKQSQATN LAQPGIFSLN NAASPIEVFQ FESQKRINSI YGLAKFGYKD YLFLDITGRN DWSSALATPF SVDGTSFFYP SASASFILSE VATLPSIFSY AQLRASIAQV GNDTNPYQTS GTFVSQTPFN SQPTFSNQDL IPNANLKPES TTSYEAGFDV RFWRDRLNLD FTYYNALTKD QIISLPIGIS SGYNQQVVNG GKVRTEGVEI IAGLIPIITD KFRWTTTFNF SKSVATVEDL PQDDGRLTLG LSRIYDSANQ TVFFQVEEGG RVGDFYGTGY LKNENGDFIL TDDGRYIADN NLQKFGNYNP DFMLGWNNQF SYGNWNLSFL FDWRQGGEIV SRTRALGNVG GQLAETAFRP EGGIIAQGVV NTGTAENPNY IPNTTAVTAE SYYRQFYDRN HEENNIYDAS YLKLRQFSVG YTFKLNDGFI GLKEGVDVNL SLVGRNLFAI TENPHFDPEQ LAVQGQSFVS GVEDMSYATT RSIGFKAGFN F
|
| |