Gene Cagg_3590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3590 
Symbol 
ID7269734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4365004 
End bp4367814 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content57% 
IMG OID643568398 
ProductDNA polymerase I 
Protein accessionYP_002464864 
Protein GI219850431 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000120816 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCGTC CGCTCTTGAT TTTGGTTGAC GGTCACGCAC TGGCATATCG CGCATTCTTT 
GCTTTGCGCG AGAGCGGCTT GCGCTCGTCG CGAGGTGAAC CGACGTATGC CGTCTTTGGG
TTTGCTCAAA TCTTGTTAAC GGCACTTGCC GAGTACCGGC CCGATTATGT TGCGGTGGCG
TTTGATGTTG GGCGAACGTT TCGCGATGAC CTATACGCCG AATACAAAGC CGGTCGCGCC
GAGACGCCGG AAGAGTTTTA TCCGCAATTC GAGCGCATTA AACAGTTGGT GCAGGCGTTG
TCAATCCCTA TCTATACCGC CGAGGGGTTT GAGGCCGATG ATGTCATCGG TAGCTTAGCT
CGCCAGGCTA CCGAGCAGGG TGTTGATACG ATTATTCTTA CCGGCGATAC CGATACGCTA
CAACTGGTGA ATGAGCACGT TCGGGTGGCG CTCGCCAATC CTTATGGCGG CAAGACGAGT
ACCACTCTGT ACGATGTCGA ACAGGTGCGT AAACGTTACG ATGGCCTCGA GCCGGCGCAG
TTGGCCGATC TGCGCGGTCT GAAGGGCGAT TCATCCGACA ATATCCCCGG TGTGCGCGGG
ATTGGTGAGA AGGGAGCGAT TACCCTCCTC AAGCAGTTCG GTTCTCTCGA TAAGCTCCTG
GATAACATTG AGGCAGCGCC CAAGCGCTAT CAACATCTCT TGCGTGAACA GGCCGACCAG
GCGCGCTCGT CGCGGCATTT GGCGACGATC GTCACCGATG CGCCGGTGCA ACTCGATCTG
GCAAAGTGTC GGCTTGGCGT GTACGATCGT GCGGCAGTAA TGGCTTTGCT CCAAGAGCTG
GAGTTTGGGG TTAGCTCGAA CCTGATCAAA AAGCTGCCGT CGGTCGTGCA AGCGGCGACG
GTTGCAACTT TACCGGCCGA TCTACCGACT GCACCACAAG GCTCGGTTCA GTTAGCGTTG
TTTGCCAACG AGTCGGCATC GCCGACGATG GTTTCCTCGG TTACGTCGGC ACAGATTGTC
CGCGATCCGC AAGCGCTGGC CGAGTTGGTA CAACGGTTGC GGGCTGCGCC GGGATTTGCA
TTCGATACCG AATGTACGAG TCTGCAAGCC GTCGGTAGTC ATCTGGTAGG GATTGCGCTG
GCTATCGCAC CTAACGATGC CTACTACGTA CCGGTTGGTC ACGAGGAGGG TGAGCAGTTG
CCGTTGGCCG ACGTGGTGGC TGCACTTGGC CCGCTGTTTG CCGACCCCAA CATACCCAAA
TTTGCCCACA ATGCGAAGTT CGATGCCGAG GTATTGGCCG GTGTCGGCAT ACAGGTGGCC
GGTCTGGCGT TTGATACCAT GATCGCGGCG GCAATGTTAG GTAAACGACA AGGGTTGAAG
GATCTGGCAT TCTACGAATT GAAACTGCCC GAACCACCGA CCACGATTGA AGATCTGATC
GGGCGAGGTA GCAAGCAGAT CAGCTTTGCT GCGGTACCGA TCGAGCAAGC CGCTCCCTAC
GCCGCCGCCG ATGCGCTGCA TACCTTACTA CTGACCGAAA CCTTGCGCGG GCAACTCACA
ACTGACACCG CCCTCCGTGA TCTCTACTAT CGGGTCGAGC TACCGCTGAT CGACGTGCTA
ACCGATATGG AGTTGACCGG TATCTTGCTC GATCACGAGT ATCTGCGCGA ACTGGGTAAA
CGGTTTGCCC AACGTATCGC CGAGCTGACC GAACAGATTT ATGCGAAGGC CGGTGGGCCG
TTTAATATCA ATTCCGGCCA ACAACTCAAC GAGGTCTTGT TTGAGCGACT GGGTATCAAT
CCGCGTGATT ATGGGCTGAG TAAGCTCAAG AGTGGTGGTT ATTCGATTAC TGCCGAGGTG
TTAGAGGAGC TAAGCCAACT CTACCCGATT GCCGCCGATA TTCTGGCTTA CCGTCAGCTT
ACCAAGCTGA AGAGTACGTA TATTGACGCT CTGCCCCAAC TGGTGAATCC ACGTACCGGA
CGCATCCATA CCTCGTACAA CCAGATCGGC GCTGCAACGG GTCGGCTGTC GTCGAATAAT
CCTAACCTGC AAAATATTCC GGTGCGCACC GAAGAGGGAC GGGAAATCCG GCGCGCGTTC
GTCGCTGCTC CGGGCCACCG TTTCGTCGCC GCCGACTACT CGCAGATTGA GTTGCGTGTG
TTGGCCCACA TCAGTGGCGA CGAAAACCTG ATCGCCGCTT TTCAGCAAGG TCTTGATATT
CACGCCGCTA CGGCCAGCCG ACTGTTTGGC GTAGCCCCTG ATCAGGTTGA CAAAAACCAG
CGTCGTGTCG CCAAGACGGT GGTGTTTGGC GTTATTTACG GAATTAGCGC TTTTGGTCTT
GCCCAACGGC TAGGTATCGA ACGCGATCTG GCGCGTCAAT TGATCGACAA CTTGTTCGAG
CAGTTCCCCG GCATCCGCCG CTATATCGAT CAAACGCTCG CATTTGGCCG GCAACACGGG
TATGTGCAAA CGTTGTTTGG CCGGCGGCGA GTGATGGAAG ATTTGCGGGC GAGTGGAGCA
CGACGGGCGG CTGCCGAGCG CGAGGCGATA AACGCACCGA TACAGGGCAC TGCCGCCGAC
ATCATGAAAA TGGCGATGGT CTATGTCCAT CGCGCTTTAC GCGAACGCGG TCTCCGCACT
CGTTTGCTCT TGCAGGTGCA TGATGAGCTG ATCGCCGAAG CGCCGGAGGA AGAGGTTCCA
GCGGCAGCTC ATCTGTTGCG TGAGGTGATG AGTAATACCT ACCAATTGGT TGTGCCGCTC
GGCGTCAATC TCGAAACCGG GCCTAATTGG GAAGAGATGG CGGCGGTGTG A
 
Protein sequence
MARPLLILVD GHALAYRAFF ALRESGLRSS RGEPTYAVFG FAQILLTALA EYRPDYVAVA 
FDVGRTFRDD LYAEYKAGRA ETPEEFYPQF ERIKQLVQAL SIPIYTAEGF EADDVIGSLA
RQATEQGVDT IILTGDTDTL QLVNEHVRVA LANPYGGKTS TTLYDVEQVR KRYDGLEPAQ
LADLRGLKGD SSDNIPGVRG IGEKGAITLL KQFGSLDKLL DNIEAAPKRY QHLLREQADQ
ARSSRHLATI VTDAPVQLDL AKCRLGVYDR AAVMALLQEL EFGVSSNLIK KLPSVVQAAT
VATLPADLPT APQGSVQLAL FANESASPTM VSSVTSAQIV RDPQALAELV QRLRAAPGFA
FDTECTSLQA VGSHLVGIAL AIAPNDAYYV PVGHEEGEQL PLADVVAALG PLFADPNIPK
FAHNAKFDAE VLAGVGIQVA GLAFDTMIAA AMLGKRQGLK DLAFYELKLP EPPTTIEDLI
GRGSKQISFA AVPIEQAAPY AAADALHTLL LTETLRGQLT TDTALRDLYY RVELPLIDVL
TDMELTGILL DHEYLRELGK RFAQRIAELT EQIYAKAGGP FNINSGQQLN EVLFERLGIN
PRDYGLSKLK SGGYSITAEV LEELSQLYPI AADILAYRQL TKLKSTYIDA LPQLVNPRTG
RIHTSYNQIG AATGRLSSNN PNLQNIPVRT EEGREIRRAF VAAPGHRFVA ADYSQIELRV
LAHISGDENL IAAFQQGLDI HAATASRLFG VAPDQVDKNQ RRVAKTVVFG VIYGISAFGL
AQRLGIERDL ARQLIDNLFE QFPGIRRYID QTLAFGRQHG YVQTLFGRRR VMEDLRASGA
RRAAAEREAI NAPIQGTAAD IMKMAMVYVH RALRERGLRT RLLLQVHDEL IAEAPEEEVP
AAAHLLREVM SNTYQLVVPL GVNLETGPNW EEMAAV