Gene Caci_3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3494 
Symbol 
ID8334847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3892342 
End bp3893670 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID644956638 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003114241 
Protein GI256392677 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0376392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.207887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATC ATGCCGAGGT GGTGGAGCTC GCCGTCGAGG TCGAGCAGGT GGGGCGGGTC 
GCGGCGATCG ATGTGGCCAA GGCCTCCGGG ATGGTGTGCA CCCGGCTGCC GTCGGAGACG
AAGGCGCAGC GCCGGGTGCA GAGGGTTTGG GCGGTGGCGG CGACCACCGA CGAGATCACA
GCTCTGGGTG ATCATCTGGT GTGTCAAGGT GTGGAGTTGG TGGTGATGGA GGCCACCGGG
GTGTTTTGGA GACCGTGGTT CTATCTGCTG GAGGATCGGG GCCTGCGGGT GTGGCTGGTG
AACGCCCGGG ATGTGAAGAA CGTGCCCAGC CGTCCGAAAA CCGACAAGCT GGATGCGATC
TGGCTGGCGA AGCTGGCCGA GCGGGGGATG TTGCGGGCCT CGTTCGTGCC GCCGGAGCCG
GTGCGCAGGC TGCGGGATCT GACCAGGCTG CGCCGCACCC TGGTGGAGGA GCGCACCCGG
TATCGGCAGC GGGTCGCCGA TGTGTTGCAG GACGCGTGTT TGAAGGTCGC CGACCCTAAA
CAGGGCCTGA CCGACCTGTT CGGCATGTCC GGGCGGGCGA TCCTGGCCGC GCTGGTCGCC
GGGCAGCGCG ACCCAAAGGC CTTGGCCGCC TTGGCGATGG GGCGGGCTGT GGTGAAGACC
GCATACTTGG AGAAGGCCCT GGCCGGCCGG TTCACCGCCC ACCACGGCTT CCTCGTGGGC
AAGCTGCTCG ATCTGCACGA CAGGCTGGAG AACGACATCG CCGAGCTGAA CGCCCGGATC
GAAGCCATGA TCGCCGAACT GGACCGGACT CCGCCACCGG ATGACAACCA CCCAGACCGG
CTACCCCTGT TGGACCGACT CGACGAGATC CCCGGAGTCT CCCGGGAGAT CGCCGCCGAC
ATCCTCGGCG AGACCGGGTT CGACATGACC GTCTTCCCCA CCGGCGGGCA CCTGGCCTCC
TGGGCCAAAC TCACCCCACG CACCATCCAA TCCGGCGCCA GAAACAGCCA CGGCGGCACC
GGTAAAGGCA ACCGCTGGAT CAAAGGACCG CTCGGACAGG CCGCGCAGGC CGCCGGCCGC
ACCAAAACCT TCCTCGGGGC ACGCTACAAA CGCATCGTCA AACACGCCCC GGCGAAGAAG
GCCCAGGTCG CCGTAGCCCG CAACATCCTG GAGATCGCCT GGGTGCTCAT CAACGACCCC
GACGCCCGCT TCACCGACCT CGGCCCCGAC TGGCACACCC GGCGAACCGA TGAAACCCGC
AAAACCCGAC AGCACATCCG CGAACTCGAA CACCTCGGCT ACACCGTCAC CCTGACCAAA
GCAGCCTGA
 
Protein sequence
MDDHAEVVEL AVEVEQVGRV AAIDVAKASG MVCTRLPSET KAQRRVQRVW AVAATTDEIT 
ALGDHLVCQG VELVVMEATG VFWRPWFYLL EDRGLRVWLV NARDVKNVPS RPKTDKLDAI
WLAKLAERGM LRASFVPPEP VRRLRDLTRL RRTLVEERTR YRQRVADVLQ DACLKVADPK
QGLTDLFGMS GRAILAALVA GQRDPKALAA LAMGRAVVKT AYLEKALAGR FTAHHGFLVG
KLLDLHDRLE NDIAELNARI EAMIAELDRT PPPDDNHPDR LPLLDRLDEI PGVSREIAAD
ILGETGFDMT VFPTGGHLAS WAKLTPRTIQ SGARNSHGGT GKGNRWIKGP LGQAAQAAGR
TKTFLGARYK RIVKHAPAKK AQVAVARNIL EIAWVLINDP DARFTDLGPD WHTRRTDETR
KTRQHIRELE HLGYTVTLTK AA