Gene Cagg_3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3757 
Symbol 
ID7267830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4579651 
End bp4581738 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content54% 
IMG OID643568564 
ProductUvrD/REP helicase 
Protein accessionYP_002465029 
Protein GI219850596 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000385426 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAT ACCATCTGAT CACTCCACCA GCTTTCTTGA GCGAGTTGCT TGGGTTGCCG 
GATAAAGTGC GAAAGGCGCT GACCCAAAAG GTAAAGATTC TTGAGCGAGA CCCGATTTCA
GCACAAGGTG ATGCCAAAAA GCTCAAAAAT AGTAATCCGC CGCTGTATCG GGTCAGAATT
GGTGAGTATC GATTGATTTA CACCTTTGAC TCAGGATGGG TGCAGCTTAT TGCTATTCGT
AAACGGGATG ATCAGACGTA TCAACGCGAT TTCGGGCAGA TCGACCTTCC CGATCACGCC
CCACCGTCGT TGGTCGATCC CGATACGATT GATGCACTCC TTACAACCCC TACGCAGTCG
GCAACCCTTC CTCCTTTGGC CGGCGATTCG TCTTCCAGAC AGCCACAATT GCTCCCACGT
CGTTTAACCG CCGAGGATTT ACGTCATTGG CACATTACCG AGCAGTATTG GTCAAGCCTT
CTAGCCGTAC AAACCGAAGA AGACCTCCTC TCGGCTCCTG TACCACAGCA GATTATTGAG
CGGGTATTGG ATATTCTCTA CCCGCGCAAT ATCGACGACA TTTTGCATCA GCCACGTTTT
CTCCTCAACG AAGCGGAAGA TCTCGACCGG TTTGTTGCCG GTGAGCTGAC CGACTTTTTA
CTCATGCTCG ATCCCGACCA GCAACGCTTA GCCACTACCT CCATTGATCA CCCTGTACTG
GTGCGTGGTG GACCGGGAAC CGGTAAGTCT ATCATTGCGC TGTATCGGGT CAAGCATCTC
GTTGAGCAAG GTATCACGCC GATACTATTT ACAACGTTTA CGAATACATT GACAGCCTAT
TCAACCCAAC TGCTCGAACG CCTGCTTGGT AGCGATCCGG CCACAAAAGG GGTGGAAGTC
GCGACCGTCG ATAGCCTGAT CGTCCGTCAC TATACGCGCG CATACGGCCC ACCGAACTTT
GCCACCGATA ATGAGCAACG GGAAGCATTA GAACAGGCAC TCCTCAATAC ACCAATGCCC
GGAGTGAATG AACGTGACCG TCAACGTCGG CGTGAGGTAT TGCAGTTGCT CGGTATCGAT
TACCTGCGTG ATGAGTTCAT CCACGTTATT GAGCACTGGG GATTGACCGA TCAGGTAGCT
TACCTGACTC AATCACGCAC AGGACGGCGC TTACCGTTGC GTCCACCGAT CCGCGAGGCA
GTCTGGGCGG TGTATGAGAC GTGGCGTGCT ATCTTGGCCC AGACCGGAAA GACCACGTGG
GGGTATGTGC GGCAACAAGC ACTGGCGTTA GCCCGGCAGC AAGCCACGCA CCCATACCGT
GCGATTGTGA TCGATGAAGC ACAAGATTTA TCACCGGTCG CGCTGCGCTA CCTATTGGCC
TGTGTCGAGT CACCCACTGC GATCTATCTA ACGGCAGATG CTGCGCAGTC GTTGTACCAA
CGCGGTTTTG CGTGGCAACA GATCGATACC ATGTTGCAAG TACGCGGACG GAGCTATGTC
TTGCGTCGCA ATTATCGTAA TACGGCCCAA ATTGCAACGG CGTGCGCTGC TATTCTCGGT
CAAAGCAACG ACCAACCAGC GATCGATGCC GTCCACGAAG GTCCCCGTCC GGTCGTGCGA
TTTTGCCGCT CCGTTAGCGA AGAGGCTATC GCCATCCGCG ATTTCTTGAT GACTGCAGCC
CGCCGGTGGC GCTTGCCCAT TCACGCCGGT GCCGTACTCT GTCATAGCCG GCAGGTAGCC
CAGCAGATCT GTGATGAATT ACAAAAGATC GGGTTGTCGG CAGAACTTAC CGAAAAGAAG
CAGATCGATC TCCGCAAGCC GGTGGTGAAG GTAATGACCA TCCATTCGGC CAAGGGGCTT
GAGTTTCCAT TTGTGGCGGT GCTGCGGTTA GCGCAAGACC ATTTGCCGCA TAGTTTCGAG
CATCTACCAC CCGAAGAACG TGACGAGATG ATCGAACAGG AGCGTCGCCT GTTCTATGTC
GGATGTAGCC GTGCAATGCG CGCACTCCTG GTATGCGCCG ATGCCGATCA ACCGTCACCG
TTTGTGCTCG ATTTGCCGAA TGAACTTTGG CAGTGGGAAG AAAAATAG
 
Protein sequence
MSKYHLITPP AFLSELLGLP DKVRKALTQK VKILERDPIS AQGDAKKLKN SNPPLYRVRI 
GEYRLIYTFD SGWVQLIAIR KRDDQTYQRD FGQIDLPDHA PPSLVDPDTI DALLTTPTQS
ATLPPLAGDS SSRQPQLLPR RLTAEDLRHW HITEQYWSSL LAVQTEEDLL SAPVPQQIIE
RVLDILYPRN IDDILHQPRF LLNEAEDLDR FVAGELTDFL LMLDPDQQRL ATTSIDHPVL
VRGGPGTGKS IIALYRVKHL VEQGITPILF TTFTNTLTAY STQLLERLLG SDPATKGVEV
ATVDSLIVRH YTRAYGPPNF ATDNEQREAL EQALLNTPMP GVNERDRQRR REVLQLLGID
YLRDEFIHVI EHWGLTDQVA YLTQSRTGRR LPLRPPIREA VWAVYETWRA ILAQTGKTTW
GYVRQQALAL ARQQATHPYR AIVIDEAQDL SPVALRYLLA CVESPTAIYL TADAAQSLYQ
RGFAWQQIDT MLQVRGRSYV LRRNYRNTAQ IATACAAILG QSNDQPAIDA VHEGPRPVVR
FCRSVSEEAI AIRDFLMTAA RRWRLPIHAG AVLCHSRQVA QQICDELQKI GLSAELTEKK
QIDLRKPVVK VMTIHSAKGL EFPFVAVLRL AQDHLPHSFE HLPPEERDEM IEQERRLFYV
GCSRAMRALL VCADADQPSP FVLDLPNELW QWEEK