Gene Cagg_0916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0916 
Symbol 
ID7267989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1143119 
End bp1144312 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content58% 
IMG OID643565764 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_002462270 
Protein GI219847837 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0119351 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTAT TAACCGTTTC TGATTTAAAC AGCGCCCTCC GCGCTCATCT TGAGGGCGAG 
GGCTTGTTTT TCGACCTTTG GTTGCTCGCC GAAGTAGTCG AGTTCCGCCG TTATCCGTCG
GGTCACTGCT ACTTCACGCT CAAGGACGAG CAAGCGAGCA TTCGAGCTGT CCTTTGGCGG
GGTGTTGCCG AACGTATCGC AACCTTACCG ACGAACGGCG ATGCTGTATT GGTGCATGGG
CGGGTTGGTT TCTACGAAGC CCGCGGTGAG CTACAGTTTG TTGTCGATCA GATTGTACCG
GCCGGGGTAG GACTGCTCAA CGCACAGCTT GCCCAACTAC GCGCCCGCCT CGAAGCTGAA
GGTTTGTTCG ATGAACGCCG CAAACGCCCG TTGCCGCCCC TGCCACGACG AATCGGCATT
GTTACCTCAC TGCAAGCGGC TGCTCTGCAA GATATGCTGA CCATTTTACG CCGTCGTTAC
CCGCTTGCGG AAGTTTTGTT GTCCCCCTGT CTGGTACAGG GCGAATTAGC CCCGGCCAGT
ATCGTTGCTG CCCTGCGCCG CGTCTACACC GAGGCGGTCG ATCTGGTGAT CTTAGCTCGT
GGTGGTGGCG CAAGTGAGGA TTTGGCGGCT TTTAACGATG AACAGGTTGT ACGAACGGTG
GCAGCTAGCC CCGTTCCGAT TATTACCGGC GTTGGTCACG AAACCGATAC GACGTTAGTT
GATGCTGTGG CCGATCTGCG TGCGCCTACT CCCTCTGCCG CCGCCGAAAT GGCCGCACCA
CCGCTGACCG AACTGCGTCA GCGGGTGCTT GCATTACACG AACGTGCAAC TGTCGCTATC
ATCGATCGTC TGCATCGGCA ACGTCAGATG GTTGCTCAAC AGCATGCTCT TCTCCAACGA
AATCATCCAC ACCGAACGAT CGAGGCCGCA CGCCAAACAG TTGATGACCT ATCACGCCGA
GCCGGACGAG CGTTCGGAAG ATGGCTACAA CTCGAACAAA CACGTCTGCA AGGCCTACAA
GCTCGCCTTG CGACACTTAG CCCACAGGCA ACGCTTGCCC GTGGGTATGC CATCGCCCAA
CAGGTTGATG GTCACGTGGT GACCGACCCG GCGCAAGTAC AGGCCGGTGA AGCCCTCACC
CTAACCGTCC GCGCCGGCCG CGTGCGCGTT ATTGTGGAGC ACACCGATGA GTGA
 
Protein sequence
MHVLTVSDLN SALRAHLEGE GLFFDLWLLA EVVEFRRYPS GHCYFTLKDE QASIRAVLWR 
GVAERIATLP TNGDAVLVHG RVGFYEARGE LQFVVDQIVP AGVGLLNAQL AQLRARLEAE
GLFDERRKRP LPPLPRRIGI VTSLQAAALQ DMLTILRRRY PLAEVLLSPC LVQGELAPAS
IVAALRRVYT EAVDLVILAR GGGASEDLAA FNDEQVVRTV AASPVPIITG VGHETDTTLV
DAVADLRAPT PSAAAEMAAP PLTELRQRVL ALHERATVAI IDRLHRQRQM VAQQHALLQR
NHPHRTIEAA RQTVDDLSRR AGRAFGRWLQ LEQTRLQGLQ ARLATLSPQA TLARGYAIAQ
QVDGHVVTDP AQVQAGEALT LTVRAGRVRV IVEHTDE