Gene Cagg_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1805 
Symbol 
ID7267717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2212048 
End bp2213319 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content53% 
IMG OID643566644 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002463139 
Protein GI219848706 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.745604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.535567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAAGGT TGGGGGAGGT GGCAAGGCAG AGAAAAGGCT TTATTACCGT TGATGAGACG 
TTGGTTTACA AGCGTCCTAC AATCAAGCTT TATGGTCAGG GTATGGTTTT GCGCGATAAC
GTTATTGGAG CAAGCTTAAA GATAAAAAAG CAACAAGTTT GTAAGGCGTA TGATTTTGTT
GTTGCAGAAA TTGATGCAAA ATGTGGTGGA TTTGCAGTGG TTCCTCCCTT TTTAGAAGGG
GCAATTCTAA GTAGCCATTA TTTTATATTT GAACTGGACA AAGAAAAAGT TGACCCAAAT
TTCATGAGTT ACATTGTGAA GTTGCCTCTT CTTCAACGCC AAGTCGAAGC ACGGGGGTCG
ACGAATTACG CTTCTGTTAG ACCCTCGCAA GTGATTACAT ACTTAATCCC CCTCCCCCCG
CTCCCCGAGC AGCGCGCCAT TGCCCACGTG CTGCGGGCGG TGCAGCGGGC GCAGGAGGCG
AGCGAGCGGG TCATCGCCGC GTTGCGCGAA CTCAAAAAGA GCCTGATGCG CCACCTGTTC
ACCTACGGGC CGGTGGCCGT TTCCGTAGGG GCACAGCGCG CTGTAGGGGC ACAGCGCGCT
GTGCCCCTAC AGGACACCGA ACTCGGCCCC CTGCCCGCCC ACTGGCAGGT TGTGCGGTTG
GGGGAGGTGT GTCAGAAGTC CCCGCAGGTT GTTCCCACAA AGGCGCCGGA TTGGCAATTC
AAATATGTGG ATGTTTCATG TGTAGACAAT AGTTCATTGA ACATCGTGGA TTACCAAGTA
TTGACCGGTA AAGAGGCACC GAGTAGAGCG CGAAAACTGA TCAAAGCCGG AGATGTTATT
TTTGCTACGG TACGACCTTA TCTGAAGCGT ATAGCAATCG TGCCCCCTTC ACTCGATGGT
CAGGTATGCT CTACAGCTTT CTGTGTGCTT AGCCCAAAGC CTGAGGTTGA TGGCAGTTAT
CTGTTCTATG CAGTTTCGAC TGACGAGTTC GTGTCGAGTG TAGTAGAGTA TCAAAGAGGC
TCAAGTTATC CTGCGATAAC AGATAACGAT GTGAAACGTG GTTTCATCCC CCTCCCCCCG
CTCGCCGAGC AGCAGGAAAT CGCCCGCATC CTGCAGGCGG TGGATCGGCG GATTGAGGTG
GAGGAGGTGT CTGCGCGTGC GCTGGAGACG CTTTTCAAGA CCCTGCTGCA TGAGTTGATG
ACGGCGAAAC GGCGGTTGCC GCAGGAGTTC ATCGCCCGTT TCCAACAGGA GGAGATCCAT
GTGTCCGTAT GA
 
Protein sequence
MVRLGEVARQ RKGFITVDET LVYKRPTIKL YGQGMVLRDN VIGASLKIKK QQVCKAYDFV 
VAEIDAKCGG FAVVPPFLEG AILSSHYFIF ELDKEKVDPN FMSYIVKLPL LQRQVEARGS
TNYASVRPSQ VITYLIPLPP LPEQRAIAHV LRAVQRAQEA SERVIAALRE LKKSLMRHLF
TYGPVAVSVG AQRAVGAQRA VPLQDTELGP LPAHWQVVRL GEVCQKSPQV VPTKAPDWQF
KYVDVSCVDN SSLNIVDYQV LTGKEAPSRA RKLIKAGDVI FATVRPYLKR IAIVPPSLDG
QVCSTAFCVL SPKPEVDGSY LFYAVSTDEF VSSVVEYQRG SSYPAITDND VKRGFIPLPP
LAEQQEIARI LQAVDRRIEV EEVSARALET LFKTLLHELM TAKRRLPQEF IARFQQEEIH
VSV