Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_09813 |
Symbol | |
ID | 9297451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 2142207 |
End bp | 2145371 |
Gene Length | 3165 bp |
Protein Length | 1054 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003716708 |
Protein GI | 298208529 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTAC TTTTTATCGT TTTTGCGGTA GCAATGGGTG TAGGTACTTT TGTTGAAAGT TATTACAGTA CAGAGACCGC AAGGCAATAC ATTTATAATG CACATTGGTT TGAGGTTATC ATGGTGTTAT TTGCCATAAA TTTCTTCGGA AACATTTTTA GATATAACCT TCACAAGAGA GAAAAGTGGT CTACACTATT ATTACATATG TCTTTTGTAC TTATTATAGT TGGAGCAGGA ATTACACGTT ATATAGGTTT TGAAGGTATT ATGCCTATTA GAGAAGGACA AGCCACAAAT AAATTTATGT CTGAAAAAAC CTTTTTAACT TTTATGATTG ATGGTGAGGT AGATGGCGAA GCATTGCGAA GACGTAAAAG TGAAGAGCTA TTACTAGCAC CTAAAGGAAA TAATGACATT ACAATAAACA CAGATTTTAA GGGCGATCCT ATTTCTTTTA AGGTAGTTAA TTACATTCAT GGTGCAGAAG AAGGTTTTAG AGAAACAGAA GATGGAAGTA ATTACATAAA AATTGTAGAA GCTGGTGGCG GTAACCGTCA CGATCACTTT ATTAAGGAAG GCGAAGTTGT AAGTATTCAT AATGTTCTTT TTGCTTATAA CAAACCAACA GATGGCGCAA TAAATATTAC AGTACCAGAA GATGGTAGTG ATTACCTTAT AAAAACACCT TTTGAAGGTG ATTTTATGAG AATGGCAGAT CAATTTAAAG GCGAAGTTTT AAAAGATTCT GTTCAGAAAT TAAACCTGCG TTCGCTTTAT AATGTTGCTG GTATGCAGTT TGTATTTCCA GATCCAGTTG TAAAAGGAGA AGAAGGCATT GTTGAAACTG AGCAAAAATC TAAAGGGCAG GCAGATGCTG TAACATTAGA GGTAAAAGCT AATGGAGACA CAAAGAATAT TCAATTATTA GGGAGTAAAG GACGTATGCC AGACCCTAAG AAATTAGAAG TTGGAGGCTA TGATATTTAC CTAAGCTATG GTAGTAAGGA ATATGAACTT CCATTTGCTT TAATGCTAAA AGATTTTGAG GCTAAAAAAT ATCCAGGAAC AGAAAACAAT CCCACACCAA GTTACCAATC TTTTAAAAGT AAGGTAGACA TTGTAGACGA TGGTGAAAGT GTTCCTTATG AAATCTATAT GAATCACGTA TTAGATAAAG ATGGCTATCG CTTCTTCCAA GCTTCTTTTG ATGCCGATGA AAAAGGAACA ATCTTATCTG TTAATCACGA TTTCTGGGGT ACGTGGATTA CGTATATCGG GTACTTCTTA TTATACTTAG GACTTATGCT AATTTTATTT GATAAAGGTT CTCGTTTTGG CCAACTAAAG AAAATGTTGG ATAAAGTAAA AGCTAAAAAA CAAGCGTTAA CAATTTTACT TGTGCTAAGT ACAGCTTTAG GCTTTGCACA AACTACAGAT GATGGTCATG ACCATAGTGA CCCTAATCAC GTACACGAAG ATGTACAAGA AATAGATCAA GAACGATTAG ACTCTCTAAT TGTAGCTAAT GCAGTAAGCA AAGAACACGC AGAAGAGTTT GGTAAGATGA TTATACAAGA TGCTGGCGGT CGTATGAAAC CAGCAAACAC GTATTCATCA GAGCTTTTAA GAAAGTTAAG TAAGGCAGAC TCGTACAATG GTCTTACTAG TGATCAGGTT CTTATAAGTA TGTTAGAGAA CCCTACAGTT TGGTATAATA TTCCATTAAT ACATGTGGAA CGTAAAAACG ATTCAATTCG TCATATTACA GGTGTAGATG AAGATGCTAA ACTACTACCA TTAGCAAAAT TCTTTGATGC TACAGGAACA TATAGACTGG CACCTTATTT AGAAGATGCC TATCAAGCTC AAAACCCTAC ATCATTTCAA AAAGATTTTA TAAAAACAGA TCAAAAGGTA AACTTGCTTT ACAGTGCTTT AGAAGGAGAT TTATTAAAAA TATTTCCAGT TCCTGGAGCA GAAAACAACA AATGGGTAAG CTATCCAGAA TTAAAGGAAC ATAGCTTTAC GGGACGAGAC TCGGTATTTG CCTATAATGC ACTACCAATT TACTTAACAA CATTAAGACA AGCTAAACAA ACACAAGGTT ATGCTCAAGC AACTGAGGTC TTAGAAACAA TTAAAAAGTT TCAAAGTAAC CATGGTGCAG AAATTATGCC TTCAGACCAA AAGGTTAAAA CAGAAATACT TTATAATAAG TACGATGTCT TCAGGAATTT ATTTTGGATG TTTATGCTTG CAGGTCTAAT AACTCTACTG TTTGTAATTC TTAAAATATT TTATAATAAC AAGTTTATTA AAAGCTTGGT TTTAATTGGT AAAATTGCCA TTATCCTACT ATTTATAATA CATACTGCCG GCTTAATAGC ACGTTGGTAT ATTTCTGGTC ACGCACCTTG GAGTGATGCT TATGAGAGTA TGATATATGT GGCTTGGGCT ACTATGTTCT TCGGCTTGGC TTTTGGTCGT AAAAGTGATT TAACTTTGGC GTCTACAGCA TTTGTAGCTT CAATGATATT AATGATTGCT CATTGGAATT GGATGGATCC TGCAATAGCT AACCTTGTAC CTGTATTAGA TAGTTATTGG TTAATGATTC ACGTTGCTGT AATTGTTGGT AGTTATGGAC CATTTACATT GGGTATGATA TTAGGTGTTG TTTCACTATT GTTAATGATT TTTACCAATT CGAATAATAA AGCTAAAATG AAACTTAACA TACGTGAGAT AACCATTATT ACAGAGATGG CACTTACAGT TGGTTTAGTA ATGCTTACCA TAGGTAATTT CTTAGGTGGG CAATGGGCCA ATGAAAGTTG GGGACGTTAT TGGGGTTGGG ACCCTAAAGA AACTTGGGCA CTAATTAGTA TTATGGTATA TGCATTTGTG ATACATATGC GATTAGTACC TGGTTTAAGA GGTCGCTGGT TTTTCAACTT TATGTCTATC GTTGCGTTTG CAAGTATCAT GATGACGTAT TTTGGAGTTA ACTTTTACCT TTCTGGATTG CATAGTTATG CAAGTGGAGA TAAAGTAATA ACTCCAGATT TTATCTACTA CTCAATTGCA GTGGTTGTAA TTTTAGGAGC TTTGTCTTAT TGGAAATACA AAAAGCACTA TTATAAAGGA GATTATAATA AATAG
|
Protein sequence | MALLFIVFAV AMGVGTFVES YYSTETARQY IYNAHWFEVI MVLFAINFFG NIFRYNLHKR EKWSTLLLHM SFVLIIVGAG ITRYIGFEGI MPIREGQATN KFMSEKTFLT FMIDGEVDGE ALRRRKSEEL LLAPKGNNDI TINTDFKGDP ISFKVVNYIH GAEEGFRETE DGSNYIKIVE AGGGNRHDHF IKEGEVVSIH NVLFAYNKPT DGAINITVPE DGSDYLIKTP FEGDFMRMAD QFKGEVLKDS VQKLNLRSLY NVAGMQFVFP DPVVKGEEGI VETEQKSKGQ ADAVTLEVKA NGDTKNIQLL GSKGRMPDPK KLEVGGYDIY LSYGSKEYEL PFALMLKDFE AKKYPGTENN PTPSYQSFKS KVDIVDDGES VPYEIYMNHV LDKDGYRFFQ ASFDADEKGT ILSVNHDFWG TWITYIGYFL LYLGLMLILF DKGSRFGQLK KMLDKVKAKK QALTILLVLS TALGFAQTTD DGHDHSDPNH VHEDVQEIDQ ERLDSLIVAN AVSKEHAEEF GKMIIQDAGG RMKPANTYSS ELLRKLSKAD SYNGLTSDQV LISMLENPTV WYNIPLIHVE RKNDSIRHIT GVDEDAKLLP LAKFFDATGT YRLAPYLEDA YQAQNPTSFQ KDFIKTDQKV NLLYSALEGD LLKIFPVPGA ENNKWVSYPE LKEHSFTGRD SVFAYNALPI YLTTLRQAKQ TQGYAQATEV LETIKKFQSN HGAEIMPSDQ KVKTEILYNK YDVFRNLFWM FMLAGLITLL FVILKIFYNN KFIKSLVLIG KIAIILLFII HTAGLIARWY ISGHAPWSDA YESMIYVAWA TMFFGLAFGR KSDLTLASTA FVASMILMIA HWNWMDPAIA NLVPVLDSYW LMIHVAVIVG SYGPFTLGMI LGVVSLLLMI FTNSNNKAKM KLNIREITII TEMALTVGLV MLTIGNFLGG QWANESWGRY WGWDPKETWA LISIMVYAFV IHMRLVPGLR GRWFFNFMSI VAFASIMMTY FGVNFYLSGL HSYASGDKVI TPDFIYYSIA VVVILGALSY WKYKKHYYKG DYNK
|
| |