Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_09243 |
Symbol | |
ID | 9297332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 2022640 |
End bp | 2025735 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003716594 |
Protein GI | 298208415 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.100624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTACTTATTG GCTTTTTTTG TTAATATTTA CAATTTCCTT AACATCTTTG GAAGCGCAAG AAAAGCGACA AACAACAACA CAAGAACAAG AAATTGAAGC AAAACTTCAA ACCCTATTTG GTTTGGAAAA AGAAGTTTTT GAAAGGCGTG TAAATATCTC TGCACAGCAA CAAGCGCCTA AAGCTCTTAA GATGTCTAGT AATTCTTTAC AGAGATCTAC AGAAATTCAA TCCTCACACG TACAATCTGG TTTTAACCTT CAAGGCGATT TAAGCTGTAG CCAAGAGCAG GTAAGTAATA ATTTTGAAAA TGGATTATTT ATTGAAGCTG GCGGTCAAAA AGTTGCAGAT GATTTTTTTG TTTCAATAAA TACAGACACG TTTGATGTTA ACCAAATTAG TGCTAACATA CTAACACAAG GTGGTTTAGA AAGTGTTAAT ATTACATTTT ATGAAGATGA TGCTGGTTTG CCTGGTACAC AAATAGGTGC TTCAATAATT AGTTTAGTCC CAACATCTCA AGATGTTATA GGGACAGCTT TTGGATTTGA TGTACACGAT GTAGTTTTAG ATTTACCAAG CACAATTTCT TTTGCAGGCA CAGGTACAGA AGCTGTAAGA TATTGGGTTC AATTAGAAGG AAACCCAAAT ACTGCAGGAA CAAGTGTTGG TATAGAGTCA ACTTCTGTAG GTGTAATAGG AGAGTTATCT GTATTTGATA ATGATGGAAA TGCAACTAAT GAATGGCTTT TAAATGATGG CGGCTTAGAT AGTGTAATTA CTATAAGCGG TAATTGTATA CAAGTTTCAG GCTGTTTAGC ACCTGAAAAC TTTGTAGTAA GCCCAACTGG AGAAAATGCA GATTTTACTT GGGATAATGT TCCAGGTGCT GTTGATGGTT ATACACTATC AGTTTTTGAA GCTGGCGCAG ATCCTACTAC TGCAACTGCT GTGTTTAGCG CTAATTATGG TGCGGGAACA ACTATGGCAA CAGCTACAGG CTTATTTACA ACTACACCTT ACGATGCGTA TATAAATTCA GATTGTGGTG GTACAATTTC TGCTCAAAAC ACATTAAGTT TTTCAACTAC TATTGAAGAT CCTGTTTGTG GTAACAGTTA TTTAGATTCA GGAGGAAGAG ATAACAATTA TCAGGACAGT GAGTTAATTA CAACTACAAT TTTTCCTGAA AATGATGGAG ATGTTGTAAC CTTAACTTTT ACGTTTGTTG ATATTGAAGT AAATACTACA GGCGCAGGAA CTCAAGATGG TTGTTGGGAT TTCCTAACCA TTTATAATGG ACCAGATACG TCGTCTCCTG TATTAGCACA AACTCTTTGT GGTGAGGCTA GTGGTTCTGG AGCAACACCA TCTGTTGATA CAAGTAATCT TGAAATAGGT GATTCATTCA CATCAACAGA TATGTCTGGA GCATTAACAA TTGTATTTAC ATCTGATGAA GTCTTTAACT TTGGTGGATT TGAAGCCGTT ATCTCTTGCG ATGTGCCTCC TGTTTGTATG GCACCAGATT TAGTACTTGA TAATGTTACT ACAGACACTG CAGAATTTAG TTGGAGTGAA GTTGCAAATG CTAATAATGG GTATATCTTC TCTGTATTTA GCGAAGGAGC AGACCCTACA ACAGCAACTC CTGTTTATAC AGAAAATATA CCTGCAGGAA CGCTTACAGC TACAGCAACA GGTTTATCAG ATACTACTAT ATATGACGCA TACATCACTG CAGATTGTGA TGCCGACGGT TTGTCTGCAT CAGACTCGGT GACATTTGAA ACCAATTTTC CAGATCCTGC TTGTGGTGGC AAATTTTATG ATACTGGAGG ACCAAATGGT GACTTTGAAA ATAACGAAGA CTACACAACT ATAATTGCTC CAGATGATGC AGGAGATGTA GTTACTGCCA CATTTACCTT TGTAAACAAT ACAGAATTTG ATGTACTTAC TGTAGATACT GGTGATGGTA GTGGCCCTCA AGTAGTACCA GAAATACCTA TGGGAGGCAC TCCAATCTCA TATACTTCAT TTGCTTCAGA TGGTAGCTTA ACATTTCAAT TTACATCTTC AGGTGTAGTT GAAAATGCAG GTTGGGAAGC AGATATCACT TGTGATTTAC CAGCGGCTTG TTTACAACCA CTTAATTTTG ATGTTTCAGC AATTACAGAT ACATCAGCTA CATTTACTTG GGATGAAGAA ACTAATGCAA CAAATGGATA TGTTTTAGAA GTTTATAGTT TTGGTGATAG TCCAGGATCT GGAACGCCTG TTTATACAGA AACTGTTGCA TCTGGAACTT TAACAGCAAC AGCAACAGGG TTAGACACTA ACTCAATGTT TACTGCTTAT ATTTATTCAG ATTGTGATAC AGATGGTATA TCAGAAACTA CAGACATAGA GTTTGAAACC TTAATAACGC CACCAGCTTG TGATGGCACT TTTAGCGATA GTGGTGGAGT AGATGGTAAT TATTCTTCTA GTGAGGTTAC TACAACTACA ATTACACCAG ACAACGCTGG AGATGCTGTT ACTATTACGT TTACTTACGT AGATATTGAA ACAGCTACAG CTGCAGGAAG TCAAGATGGT TGTTGGGATT TTATGACCAT TTATAATGGT CCAGATACCA CATTCCCAGT TTTAGCACAA ACACTTTGTG GTGAAGAGAG TGGAGATGGT GGTGCACCTT CTGTAGATAC AAGTTTACTA TCTGTTGGAG ATGCATTTAC ATCAACAGAT CCTTCAGGTG CATTAACTAT TGTTTTTACT TCAGATAGTT CAGTTGAAGA GACAGGTTGG TTAGCAGACG TAACTTGTGC TACATTATCT GTGGATGAGT TTAGTGCAAC TAACTTTACA TATTATCCAA ATCCTAGCAC TGGACATTTA ACTATTAATT CTAAAGAAAC TATTGATTCT GTTGAAGTAA TTAATCTATT AGGACAGCAA TTAATTAAAC AGAAGCCTAA TAGCCAAGAT TATACTTTAG ACTTAACTAC GTTAAGTGCT GGGCAGTATT TCTTAAGAGC ACAAATAGAT GGTAAAACTG TGGTAAAATC TATTTTAAAA GAATAA
|
Protein sequence | MKKITYWLFL LIFTISLTSL EAQEKRQTTT QEQEIEAKLQ TLFGLEKEVF ERRVNISAQQ QAPKALKMSS NSLQRSTEIQ SSHVQSGFNL QGDLSCSQEQ VSNNFENGLF IEAGGQKVAD DFFVSINTDT FDVNQISANI LTQGGLESVN ITFYEDDAGL PGTQIGASII SLVPTSQDVI GTAFGFDVHD VVLDLPSTIS FAGTGTEAVR YWVQLEGNPN TAGTSVGIES TSVGVIGELS VFDNDGNATN EWLLNDGGLD SVITISGNCI QVSGCLAPEN FVVSPTGENA DFTWDNVPGA VDGYTLSVFE AGADPTTATA VFSANYGAGT TMATATGLFT TTPYDAYINS DCGGTISAQN TLSFSTTIED PVCGNSYLDS GGRDNNYQDS ELITTTIFPE NDGDVVTLTF TFVDIEVNTT GAGTQDGCWD FLTIYNGPDT SSPVLAQTLC GEASGSGATP SVDTSNLEIG DSFTSTDMSG ALTIVFTSDE VFNFGGFEAV ISCDVPPVCM APDLVLDNVT TDTAEFSWSE VANANNGYIF SVFSEGADPT TATPVYTENI PAGTLTATAT GLSDTTIYDA YITADCDADG LSASDSVTFE TNFPDPACGG KFYDTGGPNG DFENNEDYTT IIAPDDAGDV VTATFTFVNN TEFDVLTVDT GDGSGPQVVP EIPMGGTPIS YTSFASDGSL TFQFTSSGVV ENAGWEADIT CDLPAACLQP LNFDVSAITD TSATFTWDEE TNATNGYVLE VYSFGDSPGS GTPVYTETVA SGTLTATATG LDTNSMFTAY IYSDCDTDGI SETTDIEFET LITPPACDGT FSDSGGVDGN YSSSEVTTTT ITPDNAGDAV TITFTYVDIE TATAAGSQDG CWDFMTIYNG PDTTFPVLAQ TLCGEESGDG GAPSVDTSLL SVGDAFTSTD PSGALTIVFT SDSSVEETGW LADVTCATLS VDEFSATNFT YYPNPSTGHL TINSKETIDS VEVINLLGQQ LIKQKPNSQD YTLDLTTLSA GQYFLRAQID GKTVVKSILK E
|
| |