Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_1683 |
Symbol | |
ID | 3934131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | + |
Start bp | 1665993 |
End bp | 1668782 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637904034 |
Product | DNA polymerase I |
Protein accession | YP_509625 |
Protein GI | 89054174 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.569036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.718828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTCG GCAAAGGGCA TCATCTACAC CTTGTTGATG GATCGGCATT CATCTTCCGC GCCTATCATG CGCTGCCACC GCTGACCCGC AAGTCCGATG GCCTGCCCAT CGGTGCGGTG GCGGGTTTCT GCAACATGCT CCACAAGATG ATCGAGGGGA ATACCGGCCC CGATGCGCCG ACCCACGCCG CCGTGATCTT CGACAAGGGC AGCCACACGT TCCGCAACGA CCTCTATGAT CAATACAAAG CCAACCGTGA TGCGATGCCC GAGGATTTGC GCCCGCAGAT CCCGCTGACC CGCGACGCGA CCCGCGCGTT CAACCTTGCC TGCATCGAGA TGGAGGGGTT TGAGGCCGAC GATATCATTG CCACCTATGC CCGGATGGCG CGGGAGGCCG GGGGGCGGTG TACGATCATC AGCTCGGACA AGGACATGAT GCAGTTGGTC GGCGACGGGG TGGAGATGTT CGACGCGATG AAGAACACCC GCATCGACCG CGAGGGGGTG GAGGCGAAGT TCGGCGTTGG CCCCGAACGG GTCGTGGATG TGCAGGCGTT GGCCGGTGAC AGCGTGGATA ACGTGCCCGG CGCGCCGGGG ATCGGGATCA AGACGGCGGC GCTTTTGATC AACGAATACG GGGATCTGGA CGCGCTGCTG GACCGCGCGG AAGAGATCAA GCAACCCAAG CGGCGTCAGA CCCTGATCGA CCACGCGGAC CAGATCCGCC TGTCGCGGCA ATTGGTCCTG CTGGATGAGA ATGTGGAACT GGAGGACGGC CTCGACGCGC TGGATGTGCG TGAGCCGGAT CACGACACAC TGCTGGCGTT TCTGGCGGAA ATGGAGTTCC GCACCCTGAC CAAACGCATC GCGGACAGCG CGGGTGTTGA GGCGCCGGTG ATTGAGGAGC CCGCGCCGGA AGCAGCCCCC GATGCGCCGG AGATGCCGCC GATTGACCAC GCCAAATACG AGGTCGTTAA CGACGTTAAG ACCTTGCAGG CTTGGGTCGA CCGTGCCGTG GTGCGCGGTG AGGTGGCGTT TGATACGGAG ACGACGTCCC TCAACGAGAT GACCGCGGAG CTTGTTGGCG TGTCGCTTTG CATCGAGCCC GGGGCGGCGT GCTACATTCC GCTGCTCCAC CGGGGCGGCG GCGATGACCT CTTCGCTGAC ACCTCGCTAG CCGAGGGGCA GATCCCCTTT GACGACGCCA TGGAGATCCT GCAGCCGATG CTGGAAGATC GCAGCGTCAT GAAAGTCGCC CAAAACGCCA AGTACGATGT GAAGGTCCTG GCGAATTATG GCGTGGAGGT TGCGCCCATC GACGACACGA TGCTGCTGTC CTACGCGCTG CATGCGGGTC TGCACAATCA CGGCATGGAT GGGTTGGCGG AGCGCTATCT GGGCCACACG CCGCTACCAA TCAAGTCCCT GATCGGAAGC GGAAAATCAC AGATCACGTT TGACCGGGTG CCGATTGCCG ATGCCGCGCC CTACGCCGCA GAAGACGCCG ATATCACGCT GCGCTTTCAC AAGCTGTTCA AGCCGAAACT GCATCAGGTC GGCGTCACCA AGGTTTATGA ACGGCTGGAG CGGCCTCTGG TGCCGGTCCT CGCGCGGATG GAGCGGTCCG GCATCAAGGT CGACAAGGAC GTGCTCAGCC GTATGTCCAA CGCATTCGCG CAGAAGATGG CGGGGCTGGA GGCGGAAATC CATGAGCTGG CGGGCGAAAG TTTCAACGTC GGCTCGCCTG CGCAATTGGG CGAAATTCTG TTTGATAAGA TGGGCTTGCA AGGGGGGAAG AAGGGCAAGA CGGGCAAATA TTCCACCGGC GCGGATATTC TGGAAGATCT GGCGACTGAG CATGACTTGC CGGGCCGCGT GCTGGACTGG CGGCAGCTGT CGAAACTGAA ATCCACCTAC ACGGACGCGT TGCAGGACCA CATCAACGCC GACACGGGCC GCGTGCACAC GTCCTATTCC ATTGCGGGCG CGAATACGGG GCGGTTGGCC TCCACCGATC CCAACCTGCA GAACATCCCG ATCCGGTCGG AGGAAGGCCG CCGCATCCGG GAGGCGTTTG TAGCCGAGCC CGGTAAAGTT CTGGTGGCGC TTGATTATTC CCAGATCGAG CTGCGCATCC TCGCCCATAT CGCGGGCATC GACGCGCTGA AAGACGCGTT CAAGGACGGG CAGGACATCC ACGCCGCCAC CGCGTCCGAG ATGTTCAACG TGCCGCTGGA AGAGATGACG CCGGACGTCC GCCGTCAGGC CAAGGCGATC AACTTCGGGG TGATCTACGG CATCTCGGGC TTCGGGCTGG CGCGCAACCT GCGCATTCCG CGGGCCGAGG CGCAGGGCTT CATCGACCGC TATTTTGAAC GGTTTCCCGG CATCCGCACC TACATGGACG ACACCAAGAA ATTCGCCAAA GAAAATCTTT ACGTCCAAAC CCTGTTCGGG CGCAAAATCC ACACACCCGA GATCAATGCG AAAGGCCCCG GCGCAGGCTT TGCCGGGCGC GCCGCGATCA ACGCACCGAT CCAGGGGACA GCCGCCGACA TCATCCGCCG CGCGATGATC CGGATGGAGG ATGCGATTGA GGGCATTCCG GCCAAGATGT TGCTTCAGGT TCACGATGAA CTGGTGTTCG AGGTGGATGA AGACGCCACG GACACGCTGA TCGCCCGCGC CCGCGAGGTC ATGGAAGGCG CGGCGGACCC GGCGGTTCAT CTGTCGGTGC CCATCACCGT CGATGCAGGG CAGGGGGAAA CCTGGGCGGA GGCCCATTGA
|
Protein sequence | MAFGKGHHLH LVDGSAFIFR AYHALPPLTR KSDGLPIGAV AGFCNMLHKM IEGNTGPDAP THAAVIFDKG SHTFRNDLYD QYKANRDAMP EDLRPQIPLT RDATRAFNLA CIEMEGFEAD DIIATYARMA REAGGRCTII SSDKDMMQLV GDGVEMFDAM KNTRIDREGV EAKFGVGPER VVDVQALAGD SVDNVPGAPG IGIKTAALLI NEYGDLDALL DRAEEIKQPK RRQTLIDHAD QIRLSRQLVL LDENVELEDG LDALDVREPD HDTLLAFLAE MEFRTLTKRI ADSAGVEAPV IEEPAPEAAP DAPEMPPIDH AKYEVVNDVK TLQAWVDRAV VRGEVAFDTE TTSLNEMTAE LVGVSLCIEP GAACYIPLLH RGGGDDLFAD TSLAEGQIPF DDAMEILQPM LEDRSVMKVA QNAKYDVKVL ANYGVEVAPI DDTMLLSYAL HAGLHNHGMD GLAERYLGHT PLPIKSLIGS GKSQITFDRV PIADAAPYAA EDADITLRFH KLFKPKLHQV GVTKVYERLE RPLVPVLARM ERSGIKVDKD VLSRMSNAFA QKMAGLEAEI HELAGESFNV GSPAQLGEIL FDKMGLQGGK KGKTGKYSTG ADILEDLATE HDLPGRVLDW RQLSKLKSTY TDALQDHINA DTGRVHTSYS IAGANTGRLA STDPNLQNIP IRSEEGRRIR EAFVAEPGKV LVALDYSQIE LRILAHIAGI DALKDAFKDG QDIHAATASE MFNVPLEEMT PDVRRQAKAI NFGVIYGISG FGLARNLRIP RAEAQGFIDR YFERFPGIRT YMDDTKKFAK ENLYVQTLFG RKIHTPEINA KGPGAGFAGR AAINAPIQGT AADIIRRAMI RMEDAIEGIP AKMLLQVHDE LVFEVDEDAT DTLIARAREV MEGAADPAVH LSVPITVDAG QGETWAEAH
|
| |