Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_12688 |
Symbol | |
ID | 9298028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 2752580 |
End bp | 2755807 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003717279 |
Protein GI | 298209100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACC TTATCACCAG TTTTATACTA TTACTAACAA TTCAAATTAA CGCCCAAAAA CTAGACTTAT CACTAGTTAA AAACTTAGAG CCTCGAGAGA TTGGCCCGAG TGGTATGTCT GGTCGTATTA CAGCTATAGA TGTTGTTACT AGCAATCCAG ATATTATGTA TGCAGGTTCT GCATCTGGAG GTCTTTGGAA ATCAACTTCT GGAGGCGTTA AATGGGAACC AATTTTCGAA GACCAGCCAA CAGCATCAAT TGGTGCAGTT GCAATACAAC AGTCTAATCC TAGTGTTATA TGGGTTGGTA CTGGAGAAGG TAATCCACGT AATAGTTTAA ATGGCGGTTA TGGCATCTAT AAATCTTTAG ATGGCGGAAA GTCTTGGCAG TCTATGGGAT TAGAAAACAC ACGCCACATT CATCGTGTGG TTATAGACCC TACAAATCCT AATATAGTTT ATGCTGGTGC TATAGGTTCA CCTTGGGGAG AACATCCTGA ACGTGGTGTA TTTAAAACTA CAGATGGCGG TAAGACTTGG AAAAAAGTTC TTTTTGTAAA TAATAAAACT GGAGCTGCAG ATTTAATTAT GGATCCTAGC AATCCAAATA AACTTATGGC TGCAATGTGG GAACACAAAC GCGATCCTTG GTTTTTTAAA TCTGGCGGTG AAGGCTCTGG ACTTTATATA ACATATGATG GTGGAGACAA CTGGAAAAAG GTAACCGAAG AAGATGGCTT TCCAAAAGGC GACTTAGGTC GTATTGGTGT AGCTATTGCA CCTAATAAAC CAAATATTGT ATATGCTTTA GTTGAAGCTA AAAAAAATGC CCTTTATAAA TCTGAAGATG GTGGTTTTAA ATGGAGAAAA ATTAATGATA AGAGTGACAT AGGAAATCGT CCATTTTACT ATTCAGAAAT TTATGTAGAT CCTCAAAATG AAAATCGTGT TTATTCTGTA TTTACTTATG TAAATGTATC TGAAGATGGC GGAAGAAATT TTGAGCAACT CATGCCTGCT TATGGTGTGA GTAATGGTGT GCATCCAGAT CATCACGCTT GGTGGATTCA TCCAGAAGAT GGGAATTTTA TGATAGATGG TAACGATGGT GGCTTAAATA TTACCAAAGA TGGCGGTAAA ACTTGGCGAT TTGTAGATAA TATTCCTGTC GGACAATTTT ATCACGTAAA TGTAGATAAT GAGTTTCCTT ACAACGTATA TGGAGGTATG CAAGACAATG GCAGTTGGAG AGGTCCTGCA TACACGTGGC GTTCTCAAGG TATACGTAAC TCATACTGGC AAGAAATTTC ATTTGGAGAT GGTTTTGATG TTATTCCAGA CCCAGACAAT AGCCAATATG GCTGGACAAT GAGCCAACAA GGTTATGTAA GCCGTTACGA CTGGAAAACA GGAAATAATT ATGGTGTAAG ACCAACACAT CCAGATCCAG ATGTACAACT TCGTTTTAAT TGGAATTCTG CAATAAACAT AGATCCTTTT GATAGTAATG TGGTTTATTT TGGTAGCCAA TTTGTACACA AATCTACAGA CAAAGGTGAA ACGTGGAAAG TTATATCACC AGACCTAACT ACTAACAATC CAGAAAAACA AAAACAAAGC GAAAGTGGTG GTTTAACATT AGATGCCACT GGTGCAGAAA ACCATACTAC AATTTTAGTA ATAGAGCCAT CGCCTATCGA AAAAGATATG CTTTGGGCTG GTACAGATGA TGGCAGGATT CATTATACCA AAGATGGAGG TAATAACTGG ACAGAAGTTT CTAATCTTCC TGGCGTTCCA GAAAACTCTT GGATAGTTCA AATTAAAGCA AGTAATAAAA ATAAAGGAGA AGCTTTATTG GTTGTTAATG ACTACAGAAG GTTTAATTAT GAACCATACG CGTTTAGAAC AACAAACTAT GGTAAAACTT GGAAGCGTAT TGTAGATGCA GAAGATGTAC AAAGCTACAC TCTTAGTATA GTTGAAGATC CTATTGAAAG AAACCTATTG TTTTTAGGAA CAGATGACGG TCTTTATGTA AGCATAGATG CTGGAAACAA CTGGGCAAAA TATACAAATG GTTTCCCTAC AGTTCCTGTA AAAGATATGG TTATTCATCC TAGAGAACAC GATTTGGTAT TAGGAACCTT TGGACGTTCT TTCTGGATTT TAGATGATAT TAGACCATTA AGAGCAATGG CAAAAAACAA CAATCTTACT AAAGAGCGTG TGGTATTATT TGAGCCACCT ACTGCTTACC AATCTGCTTA CCAGCAGGCT ACAGGAAGCC GCTTTGGTGC AGATGCTATG TTTAGCGGAG ACAACAGAGC ATTTGGAGCA CGTTTATCTT ATTACATTAC TAAAAAAGAA GAAGATAAAG CGTCTAAAAA AGATGACAAA GACAAGGACT CTGAAGGTGA TGAAGAACAA GAAGAAACTT CAGATATAGA AGAACAACCA GAAATAGTTT GGGATTCTAT CATTTTAAAT GTTTATAACG GAGACAAGCT TATAAGAACA TTAAAGCAAA AAGCTCCAGA CTCAACTGGT TTACATAAAA TGCGCTGGTT TATGGATGAA AAAGGTGTAG ATAGACCTTC TCGTTCTATA AGAGAGTCCA AAAGAGAACC TAGTGGCACC ACTGTGAAAC CTGGGAATTA TACTTTAGAG ATGGTTTATG GTGATTCTAA ATCAACTCAA ACTATATCAG TAAAGTCAGA CCCTAGATTA AAAGTTTCAG AAGCTAATAT AAATGAGGTG TACAATGCTT CAAAAAAATT AGAAGGTTAT ACAGAAACTG CAGCAAATGC TGTAAGACAA TTGGTTGAAA GTAAGAATAC AGCAAAAGAC TTTCAGAAAC GACTTAAAGA TTTAGATAAG GACAAGTACA AAGAAACACT TAAGGAAATT TCTGATATTA ATAAAAAGAT AGACAGTCTT GTCGCGCTCT ATTTAGGTAA GGTTGATAAA AGACAAGGAA TTACCAGAAA TCCTGAAGTT ACAGTTATGC AACGCATAGG CTTAGCCAAT CAATATGTTA GTGGTAGCCA ACAAGGATTA ACGGCTACAG AAGAGCAATT AATTTCTCAA GCAAAAGCAC AGTTAAATGA AGCTTTATTA GAGACTAATA AGTTTTTCTC TGAAGAATGG AGTGATTTTA AATCTAAATT AGAAACTTTA GAACTAAATC CTTTTAAAAC CACAACTACC TTTAAAACAG TAAATTAA
|
Protein sequence | MKYLITSFIL LLTIQINAQK LDLSLVKNLE PREIGPSGMS GRITAIDVVT SNPDIMYAGS ASGGLWKSTS GGVKWEPIFE DQPTASIGAV AIQQSNPSVI WVGTGEGNPR NSLNGGYGIY KSLDGGKSWQ SMGLENTRHI HRVVIDPTNP NIVYAGAIGS PWGEHPERGV FKTTDGGKTW KKVLFVNNKT GAADLIMDPS NPNKLMAAMW EHKRDPWFFK SGGEGSGLYI TYDGGDNWKK VTEEDGFPKG DLGRIGVAIA PNKPNIVYAL VEAKKNALYK SEDGGFKWRK INDKSDIGNR PFYYSEIYVD PQNENRVYSV FTYVNVSEDG GRNFEQLMPA YGVSNGVHPD HHAWWIHPED GNFMIDGNDG GLNITKDGGK TWRFVDNIPV GQFYHVNVDN EFPYNVYGGM QDNGSWRGPA YTWRSQGIRN SYWQEISFGD GFDVIPDPDN SQYGWTMSQQ GYVSRYDWKT GNNYGVRPTH PDPDVQLRFN WNSAINIDPF DSNVVYFGSQ FVHKSTDKGE TWKVISPDLT TNNPEKQKQS ESGGLTLDAT GAENHTTILV IEPSPIEKDM LWAGTDDGRI HYTKDGGNNW TEVSNLPGVP ENSWIVQIKA SNKNKGEALL VVNDYRRFNY EPYAFRTTNY GKTWKRIVDA EDVQSYTLSI VEDPIERNLL FLGTDDGLYV SIDAGNNWAK YTNGFPTVPV KDMVIHPREH DLVLGTFGRS FWILDDIRPL RAMAKNNNLT KERVVLFEPP TAYQSAYQQA TGSRFGADAM FSGDNRAFGA RLSYYITKKE EDKASKKDDK DKDSEGDEEQ EETSDIEEQP EIVWDSIILN VYNGDKLIRT LKQKAPDSTG LHKMRWFMDE KGVDRPSRSI RESKREPSGT TVKPGNYTLE MVYGDSKSTQ TISVKSDPRL KVSEANINEV YNASKKLEGY TETAANAVRQ LVESKNTAKD FQKRLKDLDK DKYKETLKEI SDINKKIDSL VALYLGKVDK RQGITRNPEV TVMQRIGLAN QYVSGSQQGL TATEEQLISQ AKAQLNEALL ETNKFFSEEW SDFKSKLETL ELNPFKTTTT FKTVN
|
| |