Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_00175 |
Symbol | |
ID | 9295516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 62527 |
End bp | 65547 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | TPR-domain containing protein |
Protein accession | YP_003714805 |
Protein GI | 298206626 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTAC CCAAAATATT AGTCATCCTA AGTTTCGGAT TACTGTTACA TACAGCCTCA GCACAACAAA CAGCTGTATT TACAAATCAA TATGTAGCGT ATGAACAAGC ACTGTCTTTG TACGATAACA ATCAATTTGC TGCAGCACAA CAACAATTTG AAAAAGTAAA GCTAACCACA GAAGATGGTA CTGTTGAGGC CAATTGCTCC TATTATATTG CTAATTGTGC AGTAAGACTT AACCAGCAAG GTGCAGATGA TCTTATGCAG GATTTTGTAG ATCGATATCC TACAAGTACA AAACGCAACT CAGCGTTTAG TGATGTTGCC AACTATTACT TTGAGAACGG TCAATATTCT CGTGCAAGAA AATGGTATGA CCAAGTAAAC CAAAGTAATC TTTCTAACTC AGAAAAAGAA AAATTCAATT TTAATTATGG CTATACATTA TTTCAGAACA AACGTTTTGA TGATGCTAAA AGCTATATGA ATAAAGTAAG AGACTCTAAA GAATATGGTG CGCAGGCAAA ATATTATTTA GGATTTATGG CTTATGAAGG TGACGATTAT CAGGAAGCAG ACGAGCTGTT TGAAGAAGTA AAAGGAAACG ATCGTTATGA GAAAAACCTA AGCTACTTTC AAGCTGATAT GAATTTCAAG CTTGGTAAGT TTGAAAAAGC TGTAGCCGAA GGTAAAGCAC AACTTGCAAA GTCTAATCCT AAAGAAAAGT CGCAACTTAA CAAGATTATT GGTGAGAGCT ATTTTAATTT AGGTAATTAT GCTGAAGCTG TTCCTTACCT AAAGGAGTAT AGAGGTACAA GAGGAAAATG GAACAATACA GATTACTACC AACTAGGTTA CGCCTATTAC AAACAAGGCG AGTTTGATAA TGCGATATCT GAATTTAATA AAATTGTAGA TGGTAAAAAT GCTGTTGCTC AAAACGCATA CTACCACTTA GCAGAATCTT ATTTAAAGCT TGACCAAAAG CAGCAAGCCT TAAATGCATT TAAAAACGCT TCTGAAATGG AGTTTGATAA TGCTATTAAA AAAGACGCCC TTCTTAATTA TGCAAAGCTA TCCTATGAAA TAGGAAACAG TTATGAGAGT ACGCCAAAAG TGTTAACACG TTACTTAGAA ACGTATCCAG ATACTGCAGA AAAAACTGAG CTTCAAAATT TATTGATCGA CTCATATATT ACTTCTAAAA ACTACGCTGA AGCTATGGAA CTTTTAGAAA GCAGTAGAGA TTTTGATAAT AAAGTGGCAT ATCAAAAAGT TGCTTTTTAC AGAGGAATTG AGTTGTATAA TGAAGACAAC TATACCGAAG CAAAAACATA TTTCGAAAAA TCTTTAAGCG AACCTAGAGA CGCTTCTTTT ACAGCAAGAG CTACGTATTG GAACGCAGAA ACAGATTATA ACTTAAATAA TTTTCAAGAT GCTCTAATTG GTTATAAAGA GTTTCAAGGT ATGAGTGCGG CCTCTCAAAC AGAAGCATAT AAAAATTTAG ATTATAATTT AGGGTACGCT TACTTTAAAC AAAAAGATTA CGAGCAAGCC ATCTCATATT TCAAAAAATA CTCTGAAACC TCTTCAGATA CTTCAAGAAA AAAAGATGCT TTTTTACGCT TGGGTGATAC TTACTTTGTT ACTAGTAAAT ATTGGCCAGC AATGGAAAGC TATAATGATG CTATTGCTCT TGGCGGTAAA TCTGCAGATT ATGCTGCATT TCAAAAAGCT ATTAGTTACG GTTTTGTAAA CAAGAACGAC CGCAAGATTG AGGACTTAAC TTCATTTTTA AATCAGTTTT CAAGGTCTAC ATATCGTGAT GATGCCCTTT ATGAGCTAGG TAACACTTAT GTAGCTATCG GCAATACACA AGAAGGTATA AAAGCATACA ACCGTTTAAT TAGAGATGTC CCTAAAAGCT CTTATGTTTC AAAAGCACTT TTAAAACAAG GTCTTATTTA TTACAACTCA GATCGTGGTA ATGAAGCCTT GGAGAAATTT AAAAAAGTAG CCGCAGATTT TCCAGGAACC GCACAGGCAG ACCAAGCTGT TAAAACTGCT CGATTAATAT ATGTAGATTT AGGACGAACT TCTGAATATG CCAACTGGGT TAAAACTTTA GATTTTGTAA ATGTTACAGA TGCAGATCTT GACAACACAA CTTATGAGGC TGCAGAACAA CAATACAGGC AGGAAAATGC AAATGCAGCA CTAAGAGGTT TTGAAAATTA CCTAGAAGAG TTTCCTAATG GTTTACATGC GTTACAAAGT CATTTCTATC TAGCTCAACT TCAATTTAAA GACAATAAAA AAGAAGAAAG TATCTCTCAT TATAAATTTG TATTAACCAA AGAACGTAAT GAGTTTACAG AACAATCTTT AGCTCAATTG TCTCAAATTT ATTTAGAGAA ATCAAATTAT AAAGACGCAA TACCAGTACT TAAACGATTA GAGACCGAAG CAGATTTTCC GCAGAACATT GTCTTTGCCC AATCTAACTT AATGAAAAGC TATTACCAGC AAGACAACTA TGAGCAAGCT GTTTCCTATG CAGAAAATGT ATTGGCAAAC TCAAGTATAG ATACTAAGGT TAAAAATGAT GCGCATATCA TTATAGCAAG ATCTGCAATG AAAACTGGAG ATGAAGCTAA AGCAAAAACG GCTTATGCTA CGGTGCAAAA AACGGCTACA GGAAAGTTAG CTGCAGAAGC TTTATACTAT GATGCTTATT TTAAGAATAA AGCAGGTAAT TACAAAGCCT CAAATGAAAG TGTACAAACC TTAGCTAAAG AGTACTCAGG ATATAAAGAA TATAGTGTAA AGAGTTTATT AGTTATGGCT AAGAACTTTT ATGCTTTAGA AGATGCATAC CAAGCTACTT ATATTTTAGA AAATATCATT AACAACTTTA CAGACTACCC TACAGTTGTA GATGAAGCTA AAGCAGAGCT AATTAAAATT AAAACAGAAG AAGCTAAAAC AAATGCAGAT GTTAACACTA ACGGCAACTA A
|
Protein sequence | MKLPKILVIL SFGLLLHTAS AQQTAVFTNQ YVAYEQALSL YDNNQFAAAQ QQFEKVKLTT EDGTVEANCS YYIANCAVRL NQQGADDLMQ DFVDRYPTST KRNSAFSDVA NYYFENGQYS RARKWYDQVN QSNLSNSEKE KFNFNYGYTL FQNKRFDDAK SYMNKVRDSK EYGAQAKYYL GFMAYEGDDY QEADELFEEV KGNDRYEKNL SYFQADMNFK LGKFEKAVAE GKAQLAKSNP KEKSQLNKII GESYFNLGNY AEAVPYLKEY RGTRGKWNNT DYYQLGYAYY KQGEFDNAIS EFNKIVDGKN AVAQNAYYHL AESYLKLDQK QQALNAFKNA SEMEFDNAIK KDALLNYAKL SYEIGNSYES TPKVLTRYLE TYPDTAEKTE LQNLLIDSYI TSKNYAEAME LLESSRDFDN KVAYQKVAFY RGIELYNEDN YTEAKTYFEK SLSEPRDASF TARATYWNAE TDYNLNNFQD ALIGYKEFQG MSAASQTEAY KNLDYNLGYA YFKQKDYEQA ISYFKKYSET SSDTSRKKDA FLRLGDTYFV TSKYWPAMES YNDAIALGGK SADYAAFQKA ISYGFVNKND RKIEDLTSFL NQFSRSTYRD DALYELGNTY VAIGNTQEGI KAYNRLIRDV PKSSYVSKAL LKQGLIYYNS DRGNEALEKF KKVAADFPGT AQADQAVKTA RLIYVDLGRT SEYANWVKTL DFVNVTDADL DNTTYEAAEQ QYRQENANAA LRGFENYLEE FPNGLHALQS HFYLAQLQFK DNKKEESISH YKFVLTKERN EFTEQSLAQL SQIYLEKSNY KDAIPVLKRL ETEADFPQNI VFAQSNLMKS YYQQDNYEQA VSYAENVLAN SSIDTKVKND AHIIIARSAM KTGDEAKAKT AYATVQKTAT GKLAAEALYY DAYFKNKAGN YKASNESVQT LAKEYSGYKE YSVKSLLVMA KNFYALEDAY QATYILENII NNFTDYPTVV DEAKAELIKI KTEEAKTNAD VNTNGN
|
| |