Gene Cagg_1741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1741 
Symbol 
ID7269447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2130321 
End bp2131385 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content60% 
IMG OID643566583 
Productputative transposase IS891/IS1136/IS1341 family 
Protein accessionYP_002463078 
Protein GI219848645 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGT TCGTCTCCAA GTTGCGTCCG ACACCTGCTC AAGTGGCTTG TCTTTCTGAG 
ACGGTGGAAA CCTGCCGCCA ACGCTATAAT CACGCTCTAA GCGAGCGCAA GACCGCCTAT
CGGGAGCGTG GCGAGTCCAT CGGCTTTGCG CGCCAATGCG CCAGCCTGCC TATGCTGAAA
CGGGAGGTAC CGCATCTGCA GCGTGTCCAC TCCCAAGTGG TGCAGGATGT CGTGCGTCGA
GGAGACCGCG CGTTTCAAGC GTTCGTTCGG CGGGTGAACG CCGGTGAAAA GGCGGGGTAT
CCGCGCTGCA AAGGGCGGGG CCGGTACGAT AGCTTCACCT ATCCCCGGTG GGGCAACGGC
GTCAAGCGGG AGCAGGGACG GCTCGTTCTC TCCAAAATCG GCGCTCTCCG GCTGCACAAC
GATCGCCCGG TTGAGGGTAC GCCAAACATC TGTATCATCG TTCGCAACGC GGATGGATGG
TACGCACACA TCGTGTGTGC TGTTGCACCA TCACCACTCC CGCCACCTGG CAGGTCGGTA
GGGATTGATG TTGGACGTGA GTCGTTTGCA ACGCTGTCGA ACGGCGTGCA GATTGCCAAC
CCGCGCTCCT ATCGCACCGC CGAACGTAAG CTGAAACAAG CACAACGGCG GCTTTCTCGT
CGCGTGAAGG GCAGCAATCG CTCCCGTAAG GCACGCACGT TGCTTGCGAA CGCTCACCTG
AAGGTCAAGC GGGCGCGACG GGACTTTGCT CACACAATCG CCCGCGCACC GGTCAATGAG
GATGACCATA TTGTGGTTGA GAAACTGAAC ATTCGGGGGA TGGTACGGAA CCATCCCCTT
GCCAAATCGA TCTCCGACGC CGGATGGGGC ATCGTTCTGA ATATCCTGCT CGCCAAGGCT
GCACGTGCTG GGCGAGTCGT GTGGAAGTCA ACCCTGCCGG AACGTCGCAC ATATGCGCGC
GCTGTGGCGA GTCCGTTCCC AAACGGCTTG CCGTTCGCTG GCACTCCTGC CCGTATTGTG
GTTGTGAATT GCACCGCGAT CATAATGCTG CGCTTAACAT CCTAA
 
Protein sequence
MKTFVSKLRP TPAQVACLSE TVETCRQRYN HALSERKTAY RERGESIGFA RQCASLPMLK 
REVPHLQRVH SQVVQDVVRR GDRAFQAFVR RVNAGEKAGY PRCKGRGRYD SFTYPRWGNG
VKREQGRLVL SKIGALRLHN DRPVEGTPNI CIIVRNADGW YAHIVCAVAP SPLPPPGRSV
GIDVGRESFA TLSNGVQIAN PRSYRTAERK LKQAQRRLSR RVKGSNRSRK ARTLLANAHL
KVKRARRDFA HTIARAPVNE DDHIVVEKLN IRGMVRNHPL AKSISDAGWG IVLNILLAKA
ARAGRVVWKS TLPERRTYAR AVASPFPNGL PFAGTPARIV VVNCTAIIML RLTS