Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_03800 |
Symbol | |
ID | 9296247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 861738 |
End bp | 866798 |
Gene Length | 5061 bp |
Protein Length | 1686 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003715524 |
Protein GI | 298207345 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACTAAAG AAAAAGAGCA TAACGATCCA AAACAAAAGC GACCACTTTG GAACCGCATA TTGAGAATTT TTTTAAAATT CTTAGCTGTA CTCATTTTAC TATTTATTAT TCTTGTTTTA GTAGTTAGAA GTGAATGGGG ACAAAACCTA ATTGTAAGCA AGGCTGTAAA TTATGTATTT AATAAAACAA ATACTAAAGT AGATATAGAA AAACTCTTTA TCACTTTTGA TGGCGATATA CAGTTAGATG GCTTGTACTT AGAAGATAAA AAAGGAGACA CACTCGTATA CTCTAAATCT TTAGAAGCCG GCATACCACT TTGGGAAGCT ATTAATGGAA ATATTGGGAT AGATAATGTA GACTGGAAAG GCCTACGTGC CAACATTATT AGACAAGACT CCATAAATGG ATTTAATTAT GAGTTTCTAG TTAATGCCTT TACTCCAACA GATACCACTA CAACTGCAAC AGATAAAGAC GCTCAGTTTC CAGAGATTAG TATTGGGACC ATAAACTTTA AGGATTTTAA GGTAAAGTTT ATTGATGAGG TAAGTGGGAT AGACAGCAAG CTTAACTTAG GCGAATTTAA TCTAGATATG GATGAATTTG ATCTAAATCA AATGCGTTTT GAGATAGATA ATGTCTCACT AAAAAACACA GCAGCTTCAT TTACACAAAC TAAAAGTGTC CCAGAAACAG AAGATCTAAA TGAGTCTCCT AAACCCTACA TATCTATAGG AAAGTTATCC CTAGAAAAAG TTAAAGGAAC TTTTGAATCT GTTCCTAACA AACTTTTTGC AGATGCTAAT ATTGGTGAAT TAATTTTAAA ATTAGATGAG GCAGACTTGC TCAAAAACAC TGTCGATGTA TCTCAGCTTT CATTAAACTC TACTTCACTT TTATTTAAAA TAGAGGATAC TCAAGAAGAT GAAGATGAGG TCTCTAACCC TACAGTATCA CAACCTTTTG AGTGGCCAAT CTGGACTGTA AATGTAGACG GTATTAAAAT GAGTGATAAT AACATATCCT ATTTTAAGAA CAATGCAGTG TCTAAACGTG GTGTATTTAA TCCTGATGCC ATTATTATTA ATGACTTTAA TTTTGAAGCA GAAGACCTTC AATTAAAAGA AAAATCTGCA TTAGCAAGTA TTAATAATCT TCAATTTACA GAAGTATCTG GCATACAGTT AAAAGAAACT AATTTAAATC TAACGGTTAC AGATACAGAA GCTACTGTTT CAGATTTAAA CCTTGAGGTA AATAATAACT TCTTAAGAGG TCGCTTAAAG TTAGACTACA ATAACATAGC TTCAGCAATA GAAACTCCTG AAAATGCAAC TATAAATTTA GATATACCAA ACCTTATAGT TAATGTAAAT GATGCTTTTC TGTTTCAACC AGAATTAAAA CAAAACGAAT ACTTGCGTGC TTTAAGTAAA AAAAATATTT CTGGATCTCT AAGGATGTAT GGGTTATTGA GTGATGTTAA TATTCCTAAT GCACGTATAA ATTGGGGGAA TTCAACATCT ATTTCAGTTA AGGGACGCAT TCAAAATGCT ACAGACGTAG AAAACCTTCA GCTTAATATC CCGACGTATA TAGTAAACTC AACCAAAACA GATTTAAACC AATTTGTAGT TGAAGATTCT TTGGGTGTAA GTATTCCTAA GGACATTCAG ATTAAAGGAA GTTTGCGCGG TATGTTGAAT GATATCACAA CAGATTCTCG AATTGTAACA ACTGACGGTA ATGTTACTAT TAATGGAAAC TTTAAGAATG CAAACGCTAT TGCATTTAAT GCTTTTGTTG AAACAGATAG TCTTAAGCTT GGTAAGATTT TAAAGAATGA CCAACTAAAC ACACTTAACT TAACCCTAAA CGCAAATGGA AGTGGTAGTA CTATTAATAC GTTAAATGCT AATGTTAGCG CAAATATTAG AGACTTTAAA TACGCCAATT ACACCTATGA TACAATTGCT TTGCAAGGCG AATTAAAAGA TGGGCAAGGT CCTATTAATC TAGACTACAA GGATGAGAAT TTAAACTTGA AGTTAAAATC TATAGTGACT TTAGATTCTG TTTCACCACA AATAGATGCA ACTTTAGACC TTGTTGGAGC AGATTTGCAG GCACTCGGTC TAACTAAACG CAATATTAAA ACGGCGTTTC AATTTAATGT TAACTTTGAA GGAAATGCAG ATACGTTTGA TGCTCGAGCA ACAATTCACG ATGCTGTAAG TGTATACGAC AACCAAACCT ATCTCATAGG TGAGCTAACA GCAAATACAT TTGTAAGACC AGATACAACC TATCTTAAGG TAGATAACAG GATGTTGAAT TTAGAATTAG AATCTAATGC AAACCCAACA CAAACAACTA CTGCAATTAC AGAACATATA AACACCTATC TATCTAAACA AGAACTACCA GACAGCTTAA AGTTTGTAAA GCTTAAGCTG AAAGGTAAAT TAACCGATGC ACCTGTATTA AAAGATGTGT TTTTAGCAGA TTTACGACAA CTAGATACTG TTAATATAGA TGTAAACTTT AGTGCAGAAA AACGAAACCT TGTCGCAAAA GTTGAAGCTC CATTTATAAA TTATGCAGAC AATGAACTAG ATAGTCTGGC ATTTAACCTT AACTCTTCGA GAGACAGCTT AACCTTCAAT TTTGGATTAA AATCTTTAAA AGCTGGACCT ATTGCTATAA AACGCACTGA GATTTTAGGT GAGGTTTCTA ACGATTCCTT AAAGCTAGAT TTCTTAGCCT ATAATGATAC AGAAAAGCTC ATTCACTTAA AACCAGAATT TTTTAGACAA GGTGATAGCT TAAGTGTACA TATAAACCCT TCAGACCTAA CATTAAATGC CAAACCTTGG AGTATTCCTG CTAATAATAG GCTTAGGTAT ATAGATAATG TATTAGATAT CGATGACTTT ATACTTAGTA ATGAAGGGCA ATCTGTTACA ATAAGCAACA AGAAACCTAA CGTAACCAAA GACCATTTAG GCATAGATTT TAAGGGCTTC AAACTTTCCA GCATTTTAAG TTACTTAAAC CCAGATAAGG TATTGGCAAA AGGAGAACTT AATGGAGAGG TTGTTTTAGA AGAACCTTTT GGACAATCTG GTCTCCTTGC AGATTTATCT ATACAGCGAT TTCGTGTTAT GGATGTAGAT TTAAATACTT TAACCTTAAA AGCTAATTCT TCTGGTGGTG ATAAGTACAA TCTAGATATG CTTGTAAACG GTGGCGCTGT AAACTTGCAG TTAAAAGGAG ATTATCTAGC AAAAACCACA GGAGCAGAAC TAGATTTAAA CTTAGATATT AAAGAGTTTA AAATGAAAGC TTTAGAAGGT TTTAGTCAAG AGCAAATTAA AAATGCCAGC GGTAGTTTTA AAGGTAATAT AGCCGTTTCT GGAACAACGA CAGATCCTCA ATATGAAGGA GACCTTAATT TTAATGATGC TAAATTTAAC GTCGCTCAAC TTAATGCAGG CTTTCAATTA GGTAATGAAA CATTGCGTTT AGATAATGCG GGTCTGTATT TTGATAATTT CAAGATAGCA GATGAAAAAA AGAATACTTT TACTGTAGAT GGCGATGTAT TTACAGAGTC GTTAATAAAC CCTGCGTTCA ACCTGAGTTT TAAGGCAAAA GATTTTAAAG TACTCAATTC AACAAAAGAA GATAATGATC TTTATTATGG TACTGCAATA TTCGATTTAG ATGCTACATT AAATGGTGAT TTAGAGCTAC CAAAACTAAA TGCTACGCTA AATGTAGGAT CAGAAACCAA TGTAACATAC GTATTGCCAC CAAGTCAAGT TGCGGTGGAG AGTCGAGATG GCGTAGTTAT TTTTGTAAAC AAAGACAATC CAGACTCAAT TCTTACACAA ACCCAAGAAG AAGAATCTGC CATAATATCT GGTTTCGATA TAAAGACCAA CATAAAAGTT GGAAAAGAAG CTGTGGTAAA TGTAATTATA GATGAGCAAA CAGGAGACAA CTTGCGTATA CAAGGTGATG CCGATTTAAA ATTCAATATT TATCCAAATG GTCGTACTAC ACTAACAGGT CGCTATACAG TTAATGATGG GCACTATGAA CTAAGTTTAT ACAACTTAGT GAAACGCCGT TTTGAACTGA GAAAAGGCAG TACTATAACT TGGGCAGGAG ATCCTTTAGA TGCAACTCTA GATGCAAGTG CAATATATAG AGTTAAAACA TCTGCTTCTG CATTAATGGC AAGCACAACA TCTGGAGCAG ACATTTCTAC AAAACAACGT TTCCGTCAAG AATTGCCCTT TTTAGTGTAT TTAAATGTTG ATGGCCAATT AGATGAACCA AAGTTAACCT TCGATATTAA TTTACCAGAA GATGAGCAAG GTGCTATAGG AGGTCAAGTA TATGGACGTT TGCAACAATT AAATCAGCAA GAAAATGAGT TAAATAAGCA AGTGTTTTCA CTGTTAGTTC TTAACCGTTT TTTTCCAGAT ACAGGCAGTG ATGGTAGTGG TGGTGGTACA GCAAGTATTG CGAGAGATAA CATAAATCAA GCTTTAAGTG ACCAGTTAAA CGTTTATGCC GATAAGCTTT TAGGAAATAC AGGTGTAGAG CTGGACTTTG GATTAGACAG TTATACAGAT TATCAAGGAA ACAGTCCAAC AGAACGCACC ACGCTTGATG TGGCTGCACA AAAGAAATTT TTAGACGACC GCTTAGTAGT TCGCGTAGGT AGTGAAGTAG ATGTACAAGG AAGCAGTAAT ACACAAGGTG ACGGCAGCAC AACACCATTA GTTGGTAATG TAAGTATAGA GTATTTAATA ACCGAAAATG GTAAATATAG ACTTAAAGGA TTTCGAAGAA ATCAGTTTGA GAATGTTATA GACGGCCAAC TCATTGTTAG CGGTCTTGCC ATTATTTTCA CTCAAGAGTT TAATAAGTTC GATGAGTTAT TTAAAAACTT CTTGAGCAGT AGAACAGGTG CTGATGCAAA AAAAGAAGAA GAAACCAAAG TTGAAGAAGA GAAAGAAAAA GTGAATAAAG ACAATGAGTA A
|
Protein sequence | MTKEKEHNDP KQKRPLWNRI LRIFLKFLAV LILLFIILVL VVRSEWGQNL IVSKAVNYVF NKTNTKVDIE KLFITFDGDI QLDGLYLEDK KGDTLVYSKS LEAGIPLWEA INGNIGIDNV DWKGLRANII RQDSINGFNY EFLVNAFTPT DTTTTATDKD AQFPEISIGT INFKDFKVKF IDEVSGIDSK LNLGEFNLDM DEFDLNQMRF EIDNVSLKNT AASFTQTKSV PETEDLNESP KPYISIGKLS LEKVKGTFES VPNKLFADAN IGELILKLDE ADLLKNTVDV SQLSLNSTSL LFKIEDTQED EDEVSNPTVS QPFEWPIWTV NVDGIKMSDN NISYFKNNAV SKRGVFNPDA IIINDFNFEA EDLQLKEKSA LASINNLQFT EVSGIQLKET NLNLTVTDTE ATVSDLNLEV NNNFLRGRLK LDYNNIASAI ETPENATINL DIPNLIVNVN DAFLFQPELK QNEYLRALSK KNISGSLRMY GLLSDVNIPN ARINWGNSTS ISVKGRIQNA TDVENLQLNI PTYIVNSTKT DLNQFVVEDS LGVSIPKDIQ IKGSLRGMLN DITTDSRIVT TDGNVTINGN FKNANAIAFN AFVETDSLKL GKILKNDQLN TLNLTLNANG SGSTINTLNA NVSANIRDFK YANYTYDTIA LQGELKDGQG PINLDYKDEN LNLKLKSIVT LDSVSPQIDA TLDLVGADLQ ALGLTKRNIK TAFQFNVNFE GNADTFDARA TIHDAVSVYD NQTYLIGELT ANTFVRPDTT YLKVDNRMLN LELESNANPT QTTTAITEHI NTYLSKQELP DSLKFVKLKL KGKLTDAPVL KDVFLADLRQ LDTVNIDVNF SAEKRNLVAK VEAPFINYAD NELDSLAFNL NSSRDSLTFN FGLKSLKAGP IAIKRTEILG EVSNDSLKLD FLAYNDTEKL IHLKPEFFRQ GDSLSVHINP SDLTLNAKPW SIPANNRLRY IDNVLDIDDF ILSNEGQSVT ISNKKPNVTK DHLGIDFKGF KLSSILSYLN PDKVLAKGEL NGEVVLEEPF GQSGLLADLS IQRFRVMDVD LNTLTLKANS SGGDKYNLDM LVNGGAVNLQ LKGDYLAKTT GAELDLNLDI KEFKMKALEG FSQEQIKNAS GSFKGNIAVS GTTTDPQYEG DLNFNDAKFN VAQLNAGFQL GNETLRLDNA GLYFDNFKIA DEKKNTFTVD GDVFTESLIN PAFNLSFKAK DFKVLNSTKE DNDLYYGTAI FDLDATLNGD LELPKLNATL NVGSETNVTY VLPPSQVAVE SRDGVVIFVN KDNPDSILTQ TQEEESAIIS GFDIKTNIKV GKEAVVNVII DEQTGDNLRI QGDADLKFNI YPNGRTTLTG RYTVNDGHYE LSLYNLVKRR FELRKGSTIT WAGDPLDATL DASAIYRVKT SASALMASTT SGADISTKQR FRQELPFLVY LNVDGQLDEP KLTFDINLPE DEQGAIGGQV YGRLQQLNQQ ENELNKQVFS LLVLNRFFPD TGSDGSGGGT ASIARDNINQ ALSDQLNVYA DKLLGNTGVE LDFGLDSYTD YQGNSPTERT TLDVAAQKKF LDDRLVVRVG SEVDVQGSSN TQGDGSTTPL VGNVSIEYLI TENGKYRLKG FRRNQFENVI DGQLIVSGLA IIFTQEFNKF DELFKNFLSS RTGADAKKEE ETKVEEEKEK VNKDNE
|
| |