Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_02590 |
Symbol | |
ID | 9296003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | - |
Start bp | 609123 |
End bp | 611993 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | DNA polymerase I |
Protein accession | YP_003715284 |
Protein GI | 298207105 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAATA CTCCAGAAAA CACTCAAAAA CGTTTATTTC TTCTTGATGC CTATGCTCTA ATTTTTAGAG GCTATTATGC GTTGATAAAA AACCCAAGAG TAAACTCAAA AGGGATGGAC ACAAGTGCCA TCATGGGTTT TATGAATTCA TTGTTTGATG TTATTAAACG TGAAAAACCA GACCACTTAG CCGTAGCTTT TGATAAAGGT GGTAGTAGTG ATCGTGTAGA GATGTATGAA GATTATAAAG CCAATCGAGA TGAAACTCCA GACGCAATTA AAATAGCAGT GCCATACATA CAAAGTATAC TAAAAGCTAT GCACGTGCCT TGTATAGAAA TTGAAGGTGT AGAGGCAGAT GATTTAATAG GAACACTCTC TAAGCAAGCC GAAAAGGAAG GCTACCAAGT TTTTATGGTT ACACCAGATA AGGATTATGC GCAATTGGTT TCAGAAAATA TTTTTATGTA CAAGCCTGCA CGTATGGGTA ATGGTATAGA AATTTGGGGA ATACCAGAAG TACAGAAAAA GTTTGAAGTA GAGCGACCAG AGCAAGTAAT AGACTACTTA GGGATGATGG GAGATGCTAG TGATAATATT CCAGGTTTGC CAGGTGTTGG AGATAAGACA GCCAAAAAAT TTATTAAGCA ATATGGAGGA TTAGAAGGTT TATTAGAAAA CACAGATAAG CTTAAAGGTA AAATGAAAGA GAAAGTTATT GCTAACGCAG AGTTAGGAAG ACTTTCTAGA AAATTGGCAA CAATCATGCT AGATTGTGAT GTTACTTTTA ATGCAGAAAA CTATGAGTTG TCTGAGCCAG ATGCAGAAGC TGTAGCAGAA ATTTTTGATG ACCTTGAGTT TAGAAGACTT AAAGATCAAT TTGTAAAAAT TTTCTCTGGT GAGGCAGAAG CAAGTGTTGG CGCAGCAGTC TCTAATACAG AAACGGCTAA GAAGATTGAA ACAGCTTCGA AAAAAGCAGC TCAAGCAGGT GCAGGACAAT ACAACTTATT TGATGCTAAT AGCACGATAG CAGATGATGT TGAAGAGGCA TCATCAAGGC AGACCATAAA ATCAAAGGAA CACCATTACC AAAGTGTAGC AACATCTGGA ATGGGCTTAA AGCTATTTCT ACAAAATTTA AACAAACAGA CATCTGTTTG TTTTGATACA GAAACTACAG GACTTAATCC TTTAACTGCC GAGTTGGTAG GTATTGCGTT TTCTTGGGAT GCTGGTAAAG GATTTTACCT GCCGTTTTCA GATAACAAAG AAGAGGCACA ACAATTAATA GAAGAATTAA GGCCATTTTT TGAAAACGAA GACATTCAGA AAGTTGGGCA AAATTTAAAG TATGATATTA AAGTACTACA TAAGTATGAT ATAGAAGTTC GCGGACCTCT TTTTGATACC ATGTTAGCAC ATTATCTTAT AAACCCAGAC ATGCGCCATA ATATGGATGT GCTTGCAGAA ACATACCTCA ATTACTCACC GGTTTCAATA GAAACATTAA TTGGTAAAAA AGGTAAGAAT CAAAAAAGTA TGCGAGATGT GCCTTTAGAA GATCAAACCG AGTACGCGGT GGAAGATGCA GATATCACAC TTCAATTAAA AGAGCATTTT GAAAAAGAAT TGGGAGATGC CAATACCCAA AAATTGTTTG ATGATATTGA AATTCCATTA CTACGTGTAT TGGCAGCAAT GGAGTTGGAA GGTATTAATT TGGACAAAGC ATTTTTAAAT AATTTAGCTG AAGACTTAAA CAATGACATT GCAACTTTAG AGGCTTCGAT TTATAAAGAA GCTGGAGAAG AATTTAATAT TGGCTCGCCA AAACAACTGG GCGAAATTCT CTTTGATAAA CTTAAACTAG TAGAAAAGCC TAAGAAAACC AGAACAGGAC AATATTCAAC TGCAGAAGAT GTACTAAGTT ATCTTGCCGC AGATCACACA ATAATTCAAA ATGTACTTGA CTATCGTGGG TTGGCAAAAC TTAAAAGTAC ATATGTAGAC GCACTACCAG AGCAAGTTGA AGAAGATGGT AGAGTGCATA CAGATTATAT GCAAACCGTT GCTGCTACAG GTCGTTTAAG TAGTAATAAC CCTAATCTTC AAAATATACC AATACGTACA GAACGAGGAA GGCAAGTAAG AAAGGCATTT GTACCTAAAA ATGAAGACTA TGTACTACTC GCTGCAGATT ACAGTCAAAT TGAATTAAGG ATTATCGCGG CATTAAGCGA GGAAGACACA ATGATTGAAG CCTTTAAAAA TGGTGAAGAT ATTCACGCCA GTACTGCAGC AAAAGTTTTT AATGTACCTA TTAATGAGGT AACAAGAGAA CAGCGTAGTA ATGCTAAAAC CGTAAACTTC GGTATTATAT ATGGTGTTTC AGCTTTTGGG CTAAGTAACC AAACAGACCT TACACGAAGT GAATCGAAAG ACTTAATCGA CACGTATTAT AAAACCTACC CAAAGCTTCG AAACTACATG AGTAATCTAG TAGATGATGC TCGTGAAGAT GGTTATGTAA GTACCGTATT AGGAAGACGC CGTTATTTAA AAGATATAAA TTCTAGTAAT GGTGTTGTAA GAGGTGCTGC AGAACGAAAT GCTGTAAATG CGCCAATACA AGGTAGTGCA GCAGATATAA TTAAAGTGGC TATGATTAAT ATTCATAAAA AATTAGCCGA AGGAAACTTT AAAACTAAAA TGTTACTTCA GGTACATGAT GAATTGGTTT TTGATGTTCC AAAAAATGAG TTAGAAGACA TTAAAACTTT AGTAAAAACA GAAATGGAAA GCGCTTACAC CTTAAGCGTA CCGTTAGATG TAGAGGTTGG TGTAGGTAAC GATTGGCTAG AAGCGCATTA A
|
Protein sequence | MTNTPENTQK RLFLLDAYAL IFRGYYALIK NPRVNSKGMD TSAIMGFMNS LFDVIKREKP DHLAVAFDKG GSSDRVEMYE DYKANRDETP DAIKIAVPYI QSILKAMHVP CIEIEGVEAD DLIGTLSKQA EKEGYQVFMV TPDKDYAQLV SENIFMYKPA RMGNGIEIWG IPEVQKKFEV ERPEQVIDYL GMMGDASDNI PGLPGVGDKT AKKFIKQYGG LEGLLENTDK LKGKMKEKVI ANAELGRLSR KLATIMLDCD VTFNAENYEL SEPDAEAVAE IFDDLEFRRL KDQFVKIFSG EAEASVGAAV SNTETAKKIE TASKKAAQAG AGQYNLFDAN STIADDVEEA SSRQTIKSKE HHYQSVATSG MGLKLFLQNL NKQTSVCFDT ETTGLNPLTA ELVGIAFSWD AGKGFYLPFS DNKEEAQQLI EELRPFFENE DIQKVGQNLK YDIKVLHKYD IEVRGPLFDT MLAHYLINPD MRHNMDVLAE TYLNYSPVSI ETLIGKKGKN QKSMRDVPLE DQTEYAVEDA DITLQLKEHF EKELGDANTQ KLFDDIEIPL LRVLAAMELE GINLDKAFLN NLAEDLNNDI ATLEASIYKE AGEEFNIGSP KQLGEILFDK LKLVEKPKKT RTGQYSTAED VLSYLAADHT IIQNVLDYRG LAKLKSTYVD ALPEQVEEDG RVHTDYMQTV AATGRLSSNN PNLQNIPIRT ERGRQVRKAF VPKNEDYVLL AADYSQIELR IIAALSEEDT MIEAFKNGED IHASTAAKVF NVPINEVTRE QRSNAKTVNF GIIYGVSAFG LSNQTDLTRS ESKDLIDTYY KTYPKLRNYM SNLVDDARED GYVSTVLGRR RYLKDINSSN GVVRGAAERN AVNAPIQGSA ADIIKVAMIN IHKKLAEGNF KTKMLLQVHD ELVFDVPKNE LEDIKTLVKT EMESAYTLSV PLDVEVGVGN DWLEAH
|
| |