Gene Caci_4167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4167 
Symbol 
ID8335521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4716129 
End bp4719623 
Gene Length3495 bp 
Protein Length1164 aa 
Translation table11 
GC content68% 
IMG OID644957270 
ProductIg domain protein group 2 domain protein 
Protein accessionYP_003114872 
Protein GI256393308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.608426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0215824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGTTC GGAGAAGAAG GCGGCTCGCG GTGTCGGCCA CCGTGGTCGC GGCGCTGGTG 
TGCGGCGCGC TGGCCGGTAC CGGAGGCGCG AGCGCGCAGA CCAGCGGTAC CGACGACACG
GCAGCACATA GCGGCGGCGG AGACGGTCGG CCCTGGCTGC CGCCGACGCC GGATCAGTGG
CCGTTGGTGG TCACCGCCTC GAACACCGCA CAGCAGGAGA TCACCCGCGG TATCGACTAC
CAGACCGACA CCTACCAGAC GGTCGGCGGT GTGCAGCACT CCACGGAGCT GAACGTCGAC
CTGTCCGACC CGAACGTCCG CCTGGGCGTC GTGGAGTCCC ACAATGAGAT CAACGACGCC
GCCGACGAGG TCCCCTCCTC GATGGCGAAC CGCACCGGCG CGGTCGCGGG CATCAACGGC
GACTTCTTCG ACATCTACGG CTCCGGCAGC CCGCACGGCA TGGTCGTCAT CGACGGCCGG
CTGGTGAAGA GCCCGAACCC GGCGTGGAAC CAGAACGTGG TCGTGCGCGC CGACGGCTCG
ATCGGCATGG GCGCCGAGGC GTATTCGGGC ACTGCGACGG ACGGCGCCGC GAGCCACCCG
ATCACCTCGG TCAACACTGT CGCGGACCTG TCCGCCAACG GCCTGGTGCG CATCACACCA
GACCTCGGCG ACAGCGGCAA GATCCCGGCG TCGGTCGTCG CCACCGGACA CCGCGACCCG
GCCGATGCCA GTGTCCTGAT CATCGACGCG GTGACGCCGA ACGTCACGGA CATCACCCAG
GTACCCGCCG GTACCGAAGA CCTGGTCGGC TCAGGAACCG CCGGCCAATG GCTGACAGCG
ACAGCGCACC CCGGCGACCG CGTGACGATC GCGGAGTCGA TCAGCCCTGA CAACGCCCCG
CGCCAGGCGC TGTCCGGCGG CGCCATCCTC GTGCAGAACG GCACGATGGC CGTGCCGGTG
CAGGGCAGCG GCGAGAACAA CGTCAACAAC CCGGTCACCG GTATGGGCGT GACCAAGGAC
GGCAAGCACG CGATCGTCGC AGTGTTCGAC GGCCACCAGC CGGAGGACGC CGCCGAGGGT
CTGACCCGGC CGCAGCTCGC CGGCTGGATG ATCGCGCACG GCGCCTACAA CGCCATGGTC
TTCGACTCCG GCGGCTCCAG CGAGATGGTC GCGCGCCAGC CGGGGCAGCA GCAGGTCAGC
GTCAGCAACA CGCCCTCGGA CGGCCACGAG CGCCCCGTCG CCAACGGACT CTTCTTCTAC
AGCACCGAAC CGCATCCCGC TCCCGCAGTC CGCGCCGTCG CGAATTCCGG CGCCCCGCTG
GCGGTTCTGA CGAACAGCAC CGTCCCGGTC GGGGCATATG CGGTAGACAC GCTCGGCAAT
CCCGCGAGCG ATCCGGTGAC GCTGTCAGTG CATCCGTCTT CAAGGGCGAG TATCAGTACC
GGAACAAACG GCGCGACCCT GACGGCGTCC GGCACGCCCG GCACCGGCGA ACTGGTCGCC
ACAGCTGGCC GCGCTCATTC CTCCGTGCCG CTGCGCGTGA CCGACCACCT CTCCTCGCTC
ACCCTCAGCC CAGCCACCGC CGACCTGAAC AACGGCGGAA CTCAGCAGCT CTCCGTCAGC
GCGACGACGC GGGACAAGCA GCCGGTCTCG CTGCTGCCCG CGTCGGTGGC TTGGACCGCC
TCCCCGCCGA ACCTGGGAAG CGTCGATCCC ACAACGGGAC TGTTCACCGC GGCGACCGAC
GACGAAGGCT TGGTGACCGT CACCGCCAGC GTGGACGGCG CGAGCGCGAC CACCTCCATC
GCGGTGGGTC AGCGCACTGA AATGGTCGAC ACGATGACCG ACGTGAACAA CTGGGCGGTG
AACACCCACG GCGGCGCGAC CGGCAGCCTC TCGCTGTCCA CGACAACCAA ACGGCTGCCC
ACCGACGCCG GCTCGATGGA CGTCAAGTAC GACATCCCGG CGGGCAGCGG CGTCAAGCAG
GTCGTGTTCT CCCCCACGGT GAGCGAGTCG TTCCCGCCGG CCGGGGAGAC GCAGCTACCC
GATGGCGTCG GGATCTGGAT CAAGGGATCG GGCACCGGCG GCTCCGGAAC TCCGCTGGGT
CTGGGCAACC TCACGCTCGC CGAGGCCTAC ACCGAGGTGA ACGGCCAATA CGTCGACTTC
TACCCGTCCA CGGTGACCTA CGACGGCTGG CAGCTCATCG TCGCCAACCT GCCTGCCGGA
TTGCAGTTCC CGATGAGCGT CAAGTTCCTG GACTTCCTGG TCATCAGCCC GACCCAGACC
CTGTCCGGTG ACCTGTATGT CAGTGACCTG CAAGCCCTGT ACTCGCCGCG ACCGCTCGTG
ACGCCGCCAT ACGTGGCGAT CCCGGACAAT CCGTCGTGGC TTCAGTTCAC CGAGGACCCG
GCGAAATTCC GTGCCGGCGG CACCACCTTG GCAGCACTCG ATGACGCCCA CACGCACGCC
GACGACCCGA ACTCCACCGG TAGCGTCGTA CTCAAGCAGG ACGGCACACA GATCAAGGCT
CTGCCATCAA GCCAGACCGG AGCGCTGTCG CTGCAGACCA TGGGCGACAT GAGCGACACC
GGCAGCACCG CCAACCTGAC GTATCTGAAA TCGCTGCTGG ACGGTACCGG CGTGCCGTAC
CACGAAGGCG TCGGCAACCA CGAGATCACC CAGGGTGCGG ACCCGGAGAA CAAGAACTGG
ACCAGCCTGT TCGGCGCGAC GCACTACAGC TACACCCAGG GTGCCGCGAA CATCCTGGTG
ACCGACAGCT CCCACATCGG CATTCTGCCC TCGGACCCGT ACCAGGTCCC GGCCGGCGAC
CCGCCGCAGT ACCAGTGGCT GGCCGACCAG CTGTCGGCGA ACCGCTCGCC GGTGGTCTTC
GTGGTCAGCC ACGTCCCCGC CTACGACCCG CATCCGCAGC AGGACAGCCA GTTCGCCGAC
CGCTGGGAGG CGCAGATGTT CGAGACGCTG GTCCAGAAGT ATCAGGACAC GCATCCGCAC
ACGCACGTCA TCACGCTGTT CGGCCACGCG CGCGGCTGGG CCGAGAACCT GCTCGACCCG
ACCGGCCACA ACACAGCCGG CGGCATCCCG AACTTCGTCG TCGCCGACGC GGGTGTCGAG
GCGTACGCGC CGCCAGCCGA GGGCGGGTTC TACAACTACG GGCTCTTCCA TGTGCTGCCC
AACGGAGACG TGCAGTTCGC GGCGATCCCG ACGCTGGCCG GCATCACGGT GAGCAGCCCG
GCGAGCACGC TGCAGGCAGG CCAGTCGACG CAGCTGACCG CGACCGGAAC CACGCCAACC
GGCGACGATC TGCCCGCCCT CAGCGTGCCG ATCGCCGATC CGGCCTCGCA TGTGTGGCGC
AGCTCCGATC CGCACGTGGC GACGGTCGAC CCGGTCACCG GCGCCGTGCA CGCGTGGCAC
GCGGGGACGG CGACCGTCAC GGTGACGTCG GACGGGATCA GCGGTTCGGT GACGTTGACG
GTCACGAAGG GTTGA
 
Protein sequence
MMVRRRRRLA VSATVVAALV CGALAGTGGA SAQTSGTDDT AAHSGGGDGR PWLPPTPDQW 
PLVVTASNTA QQEITRGIDY QTDTYQTVGG VQHSTELNVD LSDPNVRLGV VESHNEINDA
ADEVPSSMAN RTGAVAGING DFFDIYGSGS PHGMVVIDGR LVKSPNPAWN QNVVVRADGS
IGMGAEAYSG TATDGAASHP ITSVNTVADL SANGLVRITP DLGDSGKIPA SVVATGHRDP
ADASVLIIDA VTPNVTDITQ VPAGTEDLVG SGTAGQWLTA TAHPGDRVTI AESISPDNAP
RQALSGGAIL VQNGTMAVPV QGSGENNVNN PVTGMGVTKD GKHAIVAVFD GHQPEDAAEG
LTRPQLAGWM IAHGAYNAMV FDSGGSSEMV ARQPGQQQVS VSNTPSDGHE RPVANGLFFY
STEPHPAPAV RAVANSGAPL AVLTNSTVPV GAYAVDTLGN PASDPVTLSV HPSSRASIST
GTNGATLTAS GTPGTGELVA TAGRAHSSVP LRVTDHLSSL TLSPATADLN NGGTQQLSVS
ATTRDKQPVS LLPASVAWTA SPPNLGSVDP TTGLFTAATD DEGLVTVTAS VDGASATTSI
AVGQRTEMVD TMTDVNNWAV NTHGGATGSL SLSTTTKRLP TDAGSMDVKY DIPAGSGVKQ
VVFSPTVSES FPPAGETQLP DGVGIWIKGS GTGGSGTPLG LGNLTLAEAY TEVNGQYVDF
YPSTVTYDGW QLIVANLPAG LQFPMSVKFL DFLVISPTQT LSGDLYVSDL QALYSPRPLV
TPPYVAIPDN PSWLQFTEDP AKFRAGGTTL AALDDAHTHA DDPNSTGSVV LKQDGTQIKA
LPSSQTGALS LQTMGDMSDT GSTANLTYLK SLLDGTGVPY HEGVGNHEIT QGADPENKNW
TSLFGATHYS YTQGAANILV TDSSHIGILP SDPYQVPAGD PPQYQWLADQ LSANRSPVVF
VVSHVPAYDP HPQQDSQFAD RWEAQMFETL VQKYQDTHPH THVITLFGHA RGWAENLLDP
TGHNTAGGIP NFVVADAGVE AYAPPAEGGF YNYGLFHVLP NGDVQFAAIP TLAGITVSSP
ASTLQAGQST QLTATGTTPT GDDLPALSVP IADPASHVWR SSDPHVATVD PVTGAVHAWH
AGTATVTVTS DGISGSVTLT VTKG