Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2073 |
Symbol | |
ID | 8333417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2345573 |
End bp | 2348716 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644955222 |
Product | hypothetical protein |
Protein accession | YP_003112833 |
Protein GI | 256391269 |
COG category | [S] Function unknown |
COG ID | [COG4485] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGAA TTCGCGAGTC CAGCCCCGGC GAGCCCATCG TCGCGACAGC TGTCGCCCTG GAGCCGCGAC CCGCGGCCGC GGGTCGCGGC GGCGGCGACG CACGCGTCCG GCTCCGGTGG TGGACCGCGC TGTGGCGCAC CCGGGTCTGG AAGCGCCGCC CGCGACCGCT GACCGTGGTC GCGGTGGTGG GCGTCATGCT GTTCGCGCTC TGGGGCATCG GCGGGCCGCT GTTCGGCGCC TCGACGCTCA CCCCGACCGA CGAGATGGTG ACCAACGGTC CGTGGGTGAG CGCCGGCTTC GCCGGGACCG TGCCCTCGAA CACCTACCTG GACGACACCT ACACCTCCGA GCTGCCCAGC GAGATCCTGT TCAAGCAGCA GCTGGGCCAC GGCAAGGTCG CGCAGTGGAA CCCGTACGGC GCGGCCGGAA GCGCGCTCGG CGCCATTCCG GACTACGCGC TCTACTCGCC GCTGACCGTG CCGTTCTATG TGCTGCCGAG CTGGCTCGCT CCCGCATATG AGCGGCTGCT GGAGATCGTG TGCTCGGTCG GCGGGGCGTT CCTGTTCCTG CGCAGACTTT CGCTGTCGAG ACCGGCGGCG CTGGTAGGCG GCCTCACCTT CGCCGGCAGC GGATTCATGG TCGCCTGGCT GGGCTTCCCG CAGACGAGAG TGGCGGCGTT CATCCCGGCG CTGTTCTGGC TGGTCGAGCG GTTCATCCAG GAGCGCCGGC CTCGGGACGC CGCGTTGGTG GCGCTGCCGG TCGCGGCGCT GTTCCTCGGC GGGTTCCCGT CGGTCGCCGG GTACGCCTTG CTGACCGCCA CTGCTTACGC ACTGGTCCGG CTCGCTGCCG AGCATCGTAC GAACCTGCGG CGGCTGGTGC GGCCGGTAGC GTATCTGGGC GCCGGGCTGG CTGCCGGTGT CGGTCTGGTG CTGTTCCAGC TGGTGCCGTT CCTGCAGTTC TTCGGGACGT GGCTGATCGC GGGCCGGAGC CAGACGGCGA CCGCCACGCT GCCGGTGTCC AGCGCGCTGA CGATGGTCGC GCCGTGGGCG TACGGGTCGG TCGACTCCAA GGATCCGGTC CAGTTCGTGT TGTCGACCAA CATGGTCGAG GCTGCCGCCT ACCTCGGTGC GGCGGCTGTG GTGCTGGTGT TCGTGGCCAT CGCGATGCCG CGCCGGGGGC GGGGTCTGCT GCCCACCGGG GCGTGGGTGT TCTTCATGGC CGCCACGGCG GTGTGGATCG AGCTGATCTA CGTCGGCGGG GCGCCGCTGG ATCTGTTCCA GAAGCTGCCC GGGCTGCGGG CGTTGTTCGA GCAGAACTTC ATCGGCAGGG CCAGGAGCAT CCTGGGGTTC CTGCTCGCGG TGCTGGTCGC GGTCGGTTTC GAGGCGCTGG TCCGGGTGCG GGCTCAGGAG CCGAAGACGG CTTCGGGGTC GGGCTCGGTT CCGGGGTCGG CGGGGCTGGC GGGGCTGGCG TCGGCGCGGG CGGCGAGGGC GAGCGCGGCG ACGGCGACGG CGACGGCGGC GGCATCCGGC GGGTGGGGGG CTTGGGCGTC GCTCGTGCCG TGGCGCTGGC CGAGGCGGTC GCTGTGGACC GCGGCGGTCG TGGTCGGCGG GATCGCCGCC GCGGCGGCTC TGGTCGCCCA CGGCTGGAGC ACGGTGCACA GCGCTGCCGC GACCTCCGGT CAGGACGTCG GCAAGGCGCT GGACCTGTAC GGCAGCCAGA TGGCACGCGC CGGGGTCATC GTCGTGATCG CCGTGTCGTG CGTGATCGCG TTGTGCGTGG CGCGGCGTCA GGCGTTCGCC GGCCGGCGCG AGGCGTCGGT GCTGCGATTC GGCGCGGCCT CCACCCTGAT CGTGCTGATG GCTGTTCAGG GCGCGCAGTT CATGGAGGGT TACTACCCGA AGTCCACTAA GTCGATGTTC TACCCGGTCA CCGACACCCA CACGTTCCTG GCGGACAACC TCGGCGAGCA GCGCTACGCC TCGGCGTACG ACGGCATCAC ATTCGGGACC GCGACAGCGT ACGATCTGCG GTCGGTGAAC GGGCACAACT TCCTGAACGC CGATTTCGCC GCGCTGATGC AGGGGATGCC GGACAGCGCG GTTCCGTATC CGACGTACGT GGACTTCCAG GCGGGCGACG TGAAGCAGGC CACCAGTCCG GTGCTGGACC GGCTGGGCAC CAAGTACTGG GTCGCCGGAC CGACCGACAA CGTGTTCGGC ACGGTCGTCT CGGCGCCGCG CGGCGGGACC ACGCAGCTCG TTCCGGGCCG GCCGGTCACG GTCCCGGTGC CGGCAGCCGG TCCGCTGCGC GGGATCTCCT TCACCCCGCA GGGCACGGTC TCCAGCAGCA TCGCGGGGCT GACCAAGGAC ACCACGGTCG AGGTCGTGAT CCGCGACGCG AGCGGCCGGC AGGTCGCCGC CGCCAACCGG CTGACCGGCG CCCGGGCCGG CGCGCCGTTC CAGGTCGCGG TCGCCGCCGA TACGCTGCCC GCCGGCACGG CGCTGACCGC GACGATCACG CTGCACGCCG ACGCGCCGCT GACCGTGGAC GCGAACCACG GGCTGCCGGC CGTCGACGCG ATCACCGACG CCGACGACGG GCTGCGCGTG GCGTATGTAG GCTCCTCGGT GATTTACGAG CGGCTGAACG CGTTGCCGCG CATCCGCTGG GCGTCACAGA GTACTGTCGT TCCCTCGCAG GACCAGCGCG TCTCGATGCT GTCCTCCGGG GCGGTGGCGG ACAACGCCGT GGTGCTCTCC GCACCGGGCC CGGCGGCGTC CGGGCAGCCG GCGGCGGTGC GGGTCCAGCA GGACGGCACA GACACGATCA CGACCACGGT TGACGCCAAG GGGTCGGGGT ACCTTGTCGT GTCCGATGCC GACCAGGTCG GCTGGCAGGC TACTGTGGAC GGCCGCCGGG CGGATCTGGT GAAGGCCGAT CAGGGACTGG TCGCGGTGGA CGTGCCGGCC GGCACGCATT CCGTGACATT GCGGTACGAC TTGCCACACC AGGCGGCCGC GACGTGGGCC TCCGGCGCCG TCGGGCTCTC GCTGATGGCG GTACCGGCGG GGGAGTGGTG GTGGGAGCGT CGGCGGCGCC GTCCTGGCGC TCGCGACGCG ATGGAGCGGG GACCGGAGGG ATGA
|
Protein sequence | MTGIRESSPG EPIVATAVAL EPRPAAAGRG GGDARVRLRW WTALWRTRVW KRRPRPLTVV AVVGVMLFAL WGIGGPLFGA STLTPTDEMV TNGPWVSAGF AGTVPSNTYL DDTYTSELPS EILFKQQLGH GKVAQWNPYG AAGSALGAIP DYALYSPLTV PFYVLPSWLA PAYERLLEIV CSVGGAFLFL RRLSLSRPAA LVGGLTFAGS GFMVAWLGFP QTRVAAFIPA LFWLVERFIQ ERRPRDAALV ALPVAALFLG GFPSVAGYAL LTATAYALVR LAAEHRTNLR RLVRPVAYLG AGLAAGVGLV LFQLVPFLQF FGTWLIAGRS QTATATLPVS SALTMVAPWA YGSVDSKDPV QFVLSTNMVE AAAYLGAAAV VLVFVAIAMP RRGRGLLPTG AWVFFMAATA VWIELIYVGG APLDLFQKLP GLRALFEQNF IGRARSILGF LLAVLVAVGF EALVRVRAQE PKTASGSGSV PGSAGLAGLA SARAARASAA TATATAAASG GWGAWASLVP WRWPRRSLWT AAVVVGGIAA AAALVAHGWS TVHSAAATSG QDVGKALDLY GSQMARAGVI VVIAVSCVIA LCVARRQAFA GRREASVLRF GAASTLIVLM AVQGAQFMEG YYPKSTKSMF YPVTDTHTFL ADNLGEQRYA SAYDGITFGT ATAYDLRSVN GHNFLNADFA ALMQGMPDSA VPYPTYVDFQ AGDVKQATSP VLDRLGTKYW VAGPTDNVFG TVVSAPRGGT TQLVPGRPVT VPVPAAGPLR GISFTPQGTV SSSIAGLTKD TTVEVVIRDA SGRQVAAANR LTGARAGAPF QVAVAADTLP AGTALTATIT LHADAPLTVD ANHGLPAVDA ITDADDGLRV AYVGSSVIYE RLNALPRIRW ASQSTVVPSQ DQRVSMLSSG AVADNAVVLS APGPAASGQP AAVRVQQDGT DTITTTVDAK GSGYLVVSDA DQVGWQATVD GRRADLVKAD QGLVAVDVPA GTHSVTLRYD LPHQAAATWA SGAVGLSLMA VPAGEWWWER RRRRPGARDA MERGPEG
|
| |