Gene Cagg_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1199 
Symbol 
ID7267948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1477815 
End bp1478957 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content60% 
IMG OID643566042 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002462544 
Protein GI219848111 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000673457 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00797435 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACGT TTGTCTCCAA GTTACGTCCG ACACCTGCTC AAGTGGCTTG TCTTTCTGAG 
ACGGTGGAAA CTTGCCGCCA ACGCTACAAT CACGCTCTGA GCGAGCGCAA GACCGCCTAT
CAGGAACGTG GCGAGTCCAT CGGTTTTGCG CGCCAATGCG CCAGCCTGCC TATGCTGAAA
CGGGAGGTGC CGCATCTGCA GCGTGTCCAC TCCCAAGTGG TGCAGGATGT CGTGCGTCGA
GGAGACCGCG CGTTTCAAGC GTTTTTTCGG CGGGTGAACG CCGGCGAAAA GGCGGGGTAT
CCGCGCTGCA AAGGGCGGGA CCGGTACGAT AGCTTCACCT ATCCCCGGTG GGGCAACGGC
GTCAAGCGGG AGCAGGGACG GCTCGTCCTC TCCAAAATCG GCGCTCTCCG GCTGCACAAC
GATCGCCCGG TTGAGGGCAC GCCAAACATC TGTATCATCG TCCGCAACGC GGATGGATGG
TATGCACACA TCGTGTGTGA CGTTGCACCA TCACCACTCC CGCCAACCGG TAAGTCGGTA
GGGATTGATG TTGGGCTTGA GTCGTTTGCA ACGCTATCGA ACGGCGTACA GATCGCTAAT
CCGCGCTCCT ATCGTGTCGC CGAACGCACG CTGAAACAAG CACAACGACG GCTCGCTCGT
CGCGTGAAGG GCAGCAATCG CTCCCGTAAG GCACGCACGT TGCTTGCGAA CGCTCACCTG
AAGGTCAAGC GGGCGCGATG GGATTTCGCC CACACAATCG CCCGCGCACC GGTCAATGAG
GATGACCATC TTGCGGTTGA GAAACTGAAC ATTCGGGGGA TGGTACGGAA CCATCCCCTT
GCCAAATCGA TCTCCGACGC CGGATGGGGC ATCTTTCTAA ATATCCTGCT CGCCAAGGCT
GCACGTGCTG GGCGAGTCGT GGTGGCAGTC AACCCTGCCG GAACGTCGCA TATATGCGCG
CGCTGTGGCG AGTCCGTTCC CAAACGACGT GCCGTTCACT GGCACTCCTG CCCGTATTGT
GGTTGTGAAT TGCACCGCGA TCATAATGCT GCGCTTACTA TCCTAAAGAA GTGCGGGGGC
GCCGCCTTCG GGGAGGCTCA GCCGTTAGGC GGGCCGCAGA ACCGAGAACC TCACAGGCTT
TAG
 
Protein sequence
MKTFVSKLRP TPAQVACLSE TVETCRQRYN HALSERKTAY QERGESIGFA RQCASLPMLK 
REVPHLQRVH SQVVQDVVRR GDRAFQAFFR RVNAGEKAGY PRCKGRDRYD SFTYPRWGNG
VKREQGRLVL SKIGALRLHN DRPVEGTPNI CIIVRNADGW YAHIVCDVAP SPLPPTGKSV
GIDVGLESFA TLSNGVQIAN PRSYRVAERT LKQAQRRLAR RVKGSNRSRK ARTLLANAHL
KVKRARWDFA HTIARAPVNE DDHLAVEKLN IRGMVRNHPL AKSISDAGWG IFLNILLAKA
ARAGRVVVAV NPAGTSHICA RCGESVPKRR AVHWHSCPYC GCELHRDHNA ALTILKKCGG
AAFGEAQPLG GPQNREPHRL