Gene Jann_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_1683 
Symbol 
ID3934131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp1665993 
End bp1668782 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content62% 
IMG OID637904034 
ProductDNA polymerase I 
Protein accessionYP_509625 
Protein GI89054174 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.569036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.718828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTCG GCAAAGGGCA TCATCTACAC CTTGTTGATG GATCGGCATT CATCTTCCGC 
GCCTATCATG CGCTGCCACC GCTGACCCGC AAGTCCGATG GCCTGCCCAT CGGTGCGGTG
GCGGGTTTCT GCAACATGCT CCACAAGATG ATCGAGGGGA ATACCGGCCC CGATGCGCCG
ACCCACGCCG CCGTGATCTT CGACAAGGGC AGCCACACGT TCCGCAACGA CCTCTATGAT
CAATACAAAG CCAACCGTGA TGCGATGCCC GAGGATTTGC GCCCGCAGAT CCCGCTGACC
CGCGACGCGA CCCGCGCGTT CAACCTTGCC TGCATCGAGA TGGAGGGGTT TGAGGCCGAC
GATATCATTG CCACCTATGC CCGGATGGCG CGGGAGGCCG GGGGGCGGTG TACGATCATC
AGCTCGGACA AGGACATGAT GCAGTTGGTC GGCGACGGGG TGGAGATGTT CGACGCGATG
AAGAACACCC GCATCGACCG CGAGGGGGTG GAGGCGAAGT TCGGCGTTGG CCCCGAACGG
GTCGTGGATG TGCAGGCGTT GGCCGGTGAC AGCGTGGATA ACGTGCCCGG CGCGCCGGGG
ATCGGGATCA AGACGGCGGC GCTTTTGATC AACGAATACG GGGATCTGGA CGCGCTGCTG
GACCGCGCGG AAGAGATCAA GCAACCCAAG CGGCGTCAGA CCCTGATCGA CCACGCGGAC
CAGATCCGCC TGTCGCGGCA ATTGGTCCTG CTGGATGAGA ATGTGGAACT GGAGGACGGC
CTCGACGCGC TGGATGTGCG TGAGCCGGAT CACGACACAC TGCTGGCGTT TCTGGCGGAA
ATGGAGTTCC GCACCCTGAC CAAACGCATC GCGGACAGCG CGGGTGTTGA GGCGCCGGTG
ATTGAGGAGC CCGCGCCGGA AGCAGCCCCC GATGCGCCGG AGATGCCGCC GATTGACCAC
GCCAAATACG AGGTCGTTAA CGACGTTAAG ACCTTGCAGG CTTGGGTCGA CCGTGCCGTG
GTGCGCGGTG AGGTGGCGTT TGATACGGAG ACGACGTCCC TCAACGAGAT GACCGCGGAG
CTTGTTGGCG TGTCGCTTTG CATCGAGCCC GGGGCGGCGT GCTACATTCC GCTGCTCCAC
CGGGGCGGCG GCGATGACCT CTTCGCTGAC ACCTCGCTAG CCGAGGGGCA GATCCCCTTT
GACGACGCCA TGGAGATCCT GCAGCCGATG CTGGAAGATC GCAGCGTCAT GAAAGTCGCC
CAAAACGCCA AGTACGATGT GAAGGTCCTG GCGAATTATG GCGTGGAGGT TGCGCCCATC
GACGACACGA TGCTGCTGTC CTACGCGCTG CATGCGGGTC TGCACAATCA CGGCATGGAT
GGGTTGGCGG AGCGCTATCT GGGCCACACG CCGCTACCAA TCAAGTCCCT GATCGGAAGC
GGAAAATCAC AGATCACGTT TGACCGGGTG CCGATTGCCG ATGCCGCGCC CTACGCCGCA
GAAGACGCCG ATATCACGCT GCGCTTTCAC AAGCTGTTCA AGCCGAAACT GCATCAGGTC
GGCGTCACCA AGGTTTATGA ACGGCTGGAG CGGCCTCTGG TGCCGGTCCT CGCGCGGATG
GAGCGGTCCG GCATCAAGGT CGACAAGGAC GTGCTCAGCC GTATGTCCAA CGCATTCGCG
CAGAAGATGG CGGGGCTGGA GGCGGAAATC CATGAGCTGG CGGGCGAAAG TTTCAACGTC
GGCTCGCCTG CGCAATTGGG CGAAATTCTG TTTGATAAGA TGGGCTTGCA AGGGGGGAAG
AAGGGCAAGA CGGGCAAATA TTCCACCGGC GCGGATATTC TGGAAGATCT GGCGACTGAG
CATGACTTGC CGGGCCGCGT GCTGGACTGG CGGCAGCTGT CGAAACTGAA ATCCACCTAC
ACGGACGCGT TGCAGGACCA CATCAACGCC GACACGGGCC GCGTGCACAC GTCCTATTCC
ATTGCGGGCG CGAATACGGG GCGGTTGGCC TCCACCGATC CCAACCTGCA GAACATCCCG
ATCCGGTCGG AGGAAGGCCG CCGCATCCGG GAGGCGTTTG TAGCCGAGCC CGGTAAAGTT
CTGGTGGCGC TTGATTATTC CCAGATCGAG CTGCGCATCC TCGCCCATAT CGCGGGCATC
GACGCGCTGA AAGACGCGTT CAAGGACGGG CAGGACATCC ACGCCGCCAC CGCGTCCGAG
ATGTTCAACG TGCCGCTGGA AGAGATGACG CCGGACGTCC GCCGTCAGGC CAAGGCGATC
AACTTCGGGG TGATCTACGG CATCTCGGGC TTCGGGCTGG CGCGCAACCT GCGCATTCCG
CGGGCCGAGG CGCAGGGCTT CATCGACCGC TATTTTGAAC GGTTTCCCGG CATCCGCACC
TACATGGACG ACACCAAGAA ATTCGCCAAA GAAAATCTTT ACGTCCAAAC CCTGTTCGGG
CGCAAAATCC ACACACCCGA GATCAATGCG AAAGGCCCCG GCGCAGGCTT TGCCGGGCGC
GCCGCGATCA ACGCACCGAT CCAGGGGACA GCCGCCGACA TCATCCGCCG CGCGATGATC
CGGATGGAGG ATGCGATTGA GGGCATTCCG GCCAAGATGT TGCTTCAGGT TCACGATGAA
CTGGTGTTCG AGGTGGATGA AGACGCCACG GACACGCTGA TCGCCCGCGC CCGCGAGGTC
ATGGAAGGCG CGGCGGACCC GGCGGTTCAT CTGTCGGTGC CCATCACCGT CGATGCAGGG
CAGGGGGAAA CCTGGGCGGA GGCCCATTGA
 
Protein sequence
MAFGKGHHLH LVDGSAFIFR AYHALPPLTR KSDGLPIGAV AGFCNMLHKM IEGNTGPDAP 
THAAVIFDKG SHTFRNDLYD QYKANRDAMP EDLRPQIPLT RDATRAFNLA CIEMEGFEAD
DIIATYARMA REAGGRCTII SSDKDMMQLV GDGVEMFDAM KNTRIDREGV EAKFGVGPER
VVDVQALAGD SVDNVPGAPG IGIKTAALLI NEYGDLDALL DRAEEIKQPK RRQTLIDHAD
QIRLSRQLVL LDENVELEDG LDALDVREPD HDTLLAFLAE MEFRTLTKRI ADSAGVEAPV
IEEPAPEAAP DAPEMPPIDH AKYEVVNDVK TLQAWVDRAV VRGEVAFDTE TTSLNEMTAE
LVGVSLCIEP GAACYIPLLH RGGGDDLFAD TSLAEGQIPF DDAMEILQPM LEDRSVMKVA
QNAKYDVKVL ANYGVEVAPI DDTMLLSYAL HAGLHNHGMD GLAERYLGHT PLPIKSLIGS
GKSQITFDRV PIADAAPYAA EDADITLRFH KLFKPKLHQV GVTKVYERLE RPLVPVLARM
ERSGIKVDKD VLSRMSNAFA QKMAGLEAEI HELAGESFNV GSPAQLGEIL FDKMGLQGGK
KGKTGKYSTG ADILEDLATE HDLPGRVLDW RQLSKLKSTY TDALQDHINA DTGRVHTSYS
IAGANTGRLA STDPNLQNIP IRSEEGRRIR EAFVAEPGKV LVALDYSQIE LRILAHIAGI
DALKDAFKDG QDIHAATASE MFNVPLEEMT PDVRRQAKAI NFGVIYGISG FGLARNLRIP
RAEAQGFIDR YFERFPGIRT YMDDTKKFAK ENLYVQTLFG RKIHTPEINA KGPGAGFAGR
AAINAPIQGT AADIIRRAMI RMEDAIEGIP AKMLLQVHDE LVFEVDEDAT DTLIARAREV
MEGAADPAVH LSVPITVDAG QGETWAEAH