Gene Caci_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2073 
Symbol 
ID8333417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2345573 
End bp2348716 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content72% 
IMG OID644955222 
Producthypothetical protein 
Protein accessionYP_003112833 
Protein GI256391269 
COG category[S] Function unknown 
COG ID[COG4485] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGAA TTCGCGAGTC CAGCCCCGGC GAGCCCATCG TCGCGACAGC TGTCGCCCTG 
GAGCCGCGAC CCGCGGCCGC GGGTCGCGGC GGCGGCGACG CACGCGTCCG GCTCCGGTGG
TGGACCGCGC TGTGGCGCAC CCGGGTCTGG AAGCGCCGCC CGCGACCGCT GACCGTGGTC
GCGGTGGTGG GCGTCATGCT GTTCGCGCTC TGGGGCATCG GCGGGCCGCT GTTCGGCGCC
TCGACGCTCA CCCCGACCGA CGAGATGGTG ACCAACGGTC CGTGGGTGAG CGCCGGCTTC
GCCGGGACCG TGCCCTCGAA CACCTACCTG GACGACACCT ACACCTCCGA GCTGCCCAGC
GAGATCCTGT TCAAGCAGCA GCTGGGCCAC GGCAAGGTCG CGCAGTGGAA CCCGTACGGC
GCGGCCGGAA GCGCGCTCGG CGCCATTCCG GACTACGCGC TCTACTCGCC GCTGACCGTG
CCGTTCTATG TGCTGCCGAG CTGGCTCGCT CCCGCATATG AGCGGCTGCT GGAGATCGTG
TGCTCGGTCG GCGGGGCGTT CCTGTTCCTG CGCAGACTTT CGCTGTCGAG ACCGGCGGCG
CTGGTAGGCG GCCTCACCTT CGCCGGCAGC GGATTCATGG TCGCCTGGCT GGGCTTCCCG
CAGACGAGAG TGGCGGCGTT CATCCCGGCG CTGTTCTGGC TGGTCGAGCG GTTCATCCAG
GAGCGCCGGC CTCGGGACGC CGCGTTGGTG GCGCTGCCGG TCGCGGCGCT GTTCCTCGGC
GGGTTCCCGT CGGTCGCCGG GTACGCCTTG CTGACCGCCA CTGCTTACGC ACTGGTCCGG
CTCGCTGCCG AGCATCGTAC GAACCTGCGG CGGCTGGTGC GGCCGGTAGC GTATCTGGGC
GCCGGGCTGG CTGCCGGTGT CGGTCTGGTG CTGTTCCAGC TGGTGCCGTT CCTGCAGTTC
TTCGGGACGT GGCTGATCGC GGGCCGGAGC CAGACGGCGA CCGCCACGCT GCCGGTGTCC
AGCGCGCTGA CGATGGTCGC GCCGTGGGCG TACGGGTCGG TCGACTCCAA GGATCCGGTC
CAGTTCGTGT TGTCGACCAA CATGGTCGAG GCTGCCGCCT ACCTCGGTGC GGCGGCTGTG
GTGCTGGTGT TCGTGGCCAT CGCGATGCCG CGCCGGGGGC GGGGTCTGCT GCCCACCGGG
GCGTGGGTGT TCTTCATGGC CGCCACGGCG GTGTGGATCG AGCTGATCTA CGTCGGCGGG
GCGCCGCTGG ATCTGTTCCA GAAGCTGCCC GGGCTGCGGG CGTTGTTCGA GCAGAACTTC
ATCGGCAGGG CCAGGAGCAT CCTGGGGTTC CTGCTCGCGG TGCTGGTCGC GGTCGGTTTC
GAGGCGCTGG TCCGGGTGCG GGCTCAGGAG CCGAAGACGG CTTCGGGGTC GGGCTCGGTT
CCGGGGTCGG CGGGGCTGGC GGGGCTGGCG TCGGCGCGGG CGGCGAGGGC GAGCGCGGCG
ACGGCGACGG CGACGGCGGC GGCATCCGGC GGGTGGGGGG CTTGGGCGTC GCTCGTGCCG
TGGCGCTGGC CGAGGCGGTC GCTGTGGACC GCGGCGGTCG TGGTCGGCGG GATCGCCGCC
GCGGCGGCTC TGGTCGCCCA CGGCTGGAGC ACGGTGCACA GCGCTGCCGC GACCTCCGGT
CAGGACGTCG GCAAGGCGCT GGACCTGTAC GGCAGCCAGA TGGCACGCGC CGGGGTCATC
GTCGTGATCG CCGTGTCGTG CGTGATCGCG TTGTGCGTGG CGCGGCGTCA GGCGTTCGCC
GGCCGGCGCG AGGCGTCGGT GCTGCGATTC GGCGCGGCCT CCACCCTGAT CGTGCTGATG
GCTGTTCAGG GCGCGCAGTT CATGGAGGGT TACTACCCGA AGTCCACTAA GTCGATGTTC
TACCCGGTCA CCGACACCCA CACGTTCCTG GCGGACAACC TCGGCGAGCA GCGCTACGCC
TCGGCGTACG ACGGCATCAC ATTCGGGACC GCGACAGCGT ACGATCTGCG GTCGGTGAAC
GGGCACAACT TCCTGAACGC CGATTTCGCC GCGCTGATGC AGGGGATGCC GGACAGCGCG
GTTCCGTATC CGACGTACGT GGACTTCCAG GCGGGCGACG TGAAGCAGGC CACCAGTCCG
GTGCTGGACC GGCTGGGCAC CAAGTACTGG GTCGCCGGAC CGACCGACAA CGTGTTCGGC
ACGGTCGTCT CGGCGCCGCG CGGCGGGACC ACGCAGCTCG TTCCGGGCCG GCCGGTCACG
GTCCCGGTGC CGGCAGCCGG TCCGCTGCGC GGGATCTCCT TCACCCCGCA GGGCACGGTC
TCCAGCAGCA TCGCGGGGCT GACCAAGGAC ACCACGGTCG AGGTCGTGAT CCGCGACGCG
AGCGGCCGGC AGGTCGCCGC CGCCAACCGG CTGACCGGCG CCCGGGCCGG CGCGCCGTTC
CAGGTCGCGG TCGCCGCCGA TACGCTGCCC GCCGGCACGG CGCTGACCGC GACGATCACG
CTGCACGCCG ACGCGCCGCT GACCGTGGAC GCGAACCACG GGCTGCCGGC CGTCGACGCG
ATCACCGACG CCGACGACGG GCTGCGCGTG GCGTATGTAG GCTCCTCGGT GATTTACGAG
CGGCTGAACG CGTTGCCGCG CATCCGCTGG GCGTCACAGA GTACTGTCGT TCCCTCGCAG
GACCAGCGCG TCTCGATGCT GTCCTCCGGG GCGGTGGCGG ACAACGCCGT GGTGCTCTCC
GCACCGGGCC CGGCGGCGTC CGGGCAGCCG GCGGCGGTGC GGGTCCAGCA GGACGGCACA
GACACGATCA CGACCACGGT TGACGCCAAG GGGTCGGGGT ACCTTGTCGT GTCCGATGCC
GACCAGGTCG GCTGGCAGGC TACTGTGGAC GGCCGCCGGG CGGATCTGGT GAAGGCCGAT
CAGGGACTGG TCGCGGTGGA CGTGCCGGCC GGCACGCATT CCGTGACATT GCGGTACGAC
TTGCCACACC AGGCGGCCGC GACGTGGGCC TCCGGCGCCG TCGGGCTCTC GCTGATGGCG
GTACCGGCGG GGGAGTGGTG GTGGGAGCGT CGGCGGCGCC GTCCTGGCGC TCGCGACGCG
ATGGAGCGGG GACCGGAGGG ATGA
 
Protein sequence
MTGIRESSPG EPIVATAVAL EPRPAAAGRG GGDARVRLRW WTALWRTRVW KRRPRPLTVV 
AVVGVMLFAL WGIGGPLFGA STLTPTDEMV TNGPWVSAGF AGTVPSNTYL DDTYTSELPS
EILFKQQLGH GKVAQWNPYG AAGSALGAIP DYALYSPLTV PFYVLPSWLA PAYERLLEIV
CSVGGAFLFL RRLSLSRPAA LVGGLTFAGS GFMVAWLGFP QTRVAAFIPA LFWLVERFIQ
ERRPRDAALV ALPVAALFLG GFPSVAGYAL LTATAYALVR LAAEHRTNLR RLVRPVAYLG
AGLAAGVGLV LFQLVPFLQF FGTWLIAGRS QTATATLPVS SALTMVAPWA YGSVDSKDPV
QFVLSTNMVE AAAYLGAAAV VLVFVAIAMP RRGRGLLPTG AWVFFMAATA VWIELIYVGG
APLDLFQKLP GLRALFEQNF IGRARSILGF LLAVLVAVGF EALVRVRAQE PKTASGSGSV
PGSAGLAGLA SARAARASAA TATATAAASG GWGAWASLVP WRWPRRSLWT AAVVVGGIAA
AAALVAHGWS TVHSAAATSG QDVGKALDLY GSQMARAGVI VVIAVSCVIA LCVARRQAFA
GRREASVLRF GAASTLIVLM AVQGAQFMEG YYPKSTKSMF YPVTDTHTFL ADNLGEQRYA
SAYDGITFGT ATAYDLRSVN GHNFLNADFA ALMQGMPDSA VPYPTYVDFQ AGDVKQATSP
VLDRLGTKYW VAGPTDNVFG TVVSAPRGGT TQLVPGRPVT VPVPAAGPLR GISFTPQGTV
SSSIAGLTKD TTVEVVIRDA SGRQVAAANR LTGARAGAPF QVAVAADTLP AGTALTATIT
LHADAPLTVD ANHGLPAVDA ITDADDGLRV AYVGSSVIYE RLNALPRIRW ASQSTVVPSQ
DQRVSMLSSG AVADNAVVLS APGPAASGQP AAVRVQQDGT DTITTTVDAK GSGYLVVSDA
DQVGWQATVD GRRADLVKAD QGLVAVDVPA GTHSVTLRYD LPHQAAATWA SGAVGLSLMA
VPAGEWWWER RRRRPGARDA MERGPEG