Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6626 |
Symbol | |
ID | 8337990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7635217 |
End bp | 7638252 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644959720 |
Product | parallel beta-helix repeat protein |
Protein accession | YP_003117313 |
Protein GI | 256395749 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0442384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.141702 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTTGT CCATACTGGG AAGACGGCTG ACGGCCTGGA CGGGCACCGC GGCGTTGACG GCCGCGGTCG CGGTGTCGGC GGCTGCGCAG ACGTCGGCCG CGACCGCCGT GACGTTCTAC GCCTCGCCGA GCGGCAGTGG TTCTGCCTGC TCGCAGGCCG CGCCGTGCTC GTTGTCGGGC GCGCAGGGCG CCGTTCGATC GCAGTTGGCT GCGACGCCGG GCGCCGACGT GACCGTGCTG GTGCAGGACG GCACGTACCG GCTGGCCGCC ACATGGGCCT TCGGTGCGGC GGATTCGGGA AGCGCCGGGC ATCCGGTGGT GTGGCAGGCC GCGCCGGGGG CGCATCCGGT GATCTCGGGG GCGTCTCAGG TCACTGGCTG GACTCAGGTG GGGACGTCCG GGGTGTGGTC GGCCGTGGTT CCGGCAGGTA GTGCCAGTCG GCAGCTGTAC GTCAGCGGAG CCGAGGCGCC GATCGCGATG TCCACGCCGT CCGCCTTGGG GTTCGCCGGC GGTTGGAGTG GCACGTCGAC CGGCTATTCG ATCGGCGGCG ACGCCGCGGC GTCGGCTTGG TTCGGCGCGC TGAGCGCGGC CCAGGTCGCC GGGGTGGAGT TCGACTACCC GGGCGGCAAC GGAGCGTGGA CCGAGTCCCG GTGCAGGGTG GCGAGCTACT CGGCCACCGC GAAGACGCTG ACGATGGCCC AGCCGTGTTG GACCGACACG ACCGCGCGCG CCTCGTTCAG CCAGGGCAGC GGCGGACTGC CGTCGATGTC GACCGGCACC ATGCCGACGC TGATCGAGGA TGCCAGGGCT CTGATCCACC CCGGGCAGTG GTTCCTCGAC TCGGCCGCGA ACACCCTCTA CTACCAGCCC GCCACCGGAC AGCAGATGAG CGCGCTGGAC GTCGAGCTGC CGCGGTTGGA GTCCCTGCTG CAAGGAGCCG GCACGCTGGC CAAGCCGCTG CACGACGTGA CGTTCCGCGG TCTGCAGTTC TCCTACGCCA CCTGGAACGC GCCCTCGGCT GCCTCCGGCT TCGCCGACGT GCAGAGCAAC CTGCGGATGA CCGGGGCGAA AAACCAGGGC ATGTGCACCT TCTCCTCACC GGCCGGGACT TGCCCTTGGG GCGCGCTGAC CCAGCCGACG GCGAACGTGG CGTTCACCGC CTCGAACAAC GTCACGCTCA CCGGGAACCG GTTCGCCGAG CTCGGCGGCG CCGGGCTGAG CGTGATGTAC GGGTCGGCGA ACACCCTCAT CCAGGGCAAC GAGTTCACGG ACATCGCCTC GACGGCGATT CTGCTCGGCT GTACCTACGA TCCGCTGCCG ACCGACGCGT CGGAGTCCGC GGGCATCAAG CAGAACTGCA CACCGAACGG CTCCGCGGTC AGCGCGGACG TGATCGGGAC CAACGAGATC CTCACCGGCA CCACCGTTTC GGACAACATC GTCCACCACA TCGGCACCGA CTACTCCTCA GCCTGCGGCA TCACGCTGTT GTTCTCGCGC GGCACCACCA TCACCCACAA CGACCTGTAC GACCTGCCCT ATACCGGCAT CACCGCCGGC GTCATCCAAG GACACGTCGA CCAGGCCAGC GCGCCGCAGA ACTCGACCAA CATCAACGAG AACAACACCC TGAGCGACAA CGTCTTCCAC AACTACCTGT CGGTGCGCAG CGACGGCGGC GCGATCTACG CCGAAGGGCA TCAGACGCAG TACGTCTACC AAAGCGGCGG CACGACGATC GACCCGGTCC AGACCCTGGC CCATGGTCTC CAGGTGACCG GCAACATCGC TTATCACGGC CCGACGACCA ACTTCACCTA CTACGACGAC GCGGGCTCGG AGTGGATCAA CTGGCAGGGC AACGTCGCCT TCGGCGCGGG CTCGGCGTCG CAGGGCGGCT GTAGCCCGAC CGGCCATTTC TGGATCGTCG GCAACTACTT CTCCAACCAG ACGCAGTACT ACCCGTGCAA CGCGCCGGTC GATTCCAACG TCAGCGGTAC GACCACCATT TCCGCCACGC CGGCGCCGGG CGACGTTCCC AACGGCCTGT TCAGCGCGGC CGGTGTGCGG GCTGCCAACT CGGCACTGGC CGTCGCCGCC GGCCCGAAGA TCTACTACGC CTCGCCGACC ACGAGCACGA GCACGCAGGT GCTCATCGGC GGCGAAGGAT TCAGCTCCAG CACGCCGGTA TTCGTGGGCT CGACGCAGGT CAGCGGCGTC CAGTACCTGT CCGGCGGCTT CCTGATCGTG CCCGTCCCGG CCGGAACCCC GTCCTCCCAG ATCTCCGTCG GCGCGCCCGC CGGCACGAGC CGGCTCAACG ACACCGATCC GTCGATCACC TACAGCGGCT TCAGCTACTC GTCGAACCGC GGTCTCGGCG ACTACGACGA CGATCTGCAC TACGCCACGG CGAACGGCTC CACGGCGAAG TTCTCGTTCT CCGGAACCGG CGTCCAGGTC TTCGGCGAGC AGAACACGGA CCAGGGCAAC ATCGGAATCA GCATCGACGG CGGTACCCAG CAGACCGTCA GCACCGTTCC CGCTGACGGG CAGCGTCACT CCAACGTGGT CGTGTATGCC GCCAGCGGGC TCGCGGCCGG GAGTCACACG ATCGTGGTGA CGAAGCTTTC CGGCCAGTAC GCCACGCTCG ACGGCTTCCA AGCGCTGAAC TCGCGCCTCA ACGACACTGA CCCGTCGATC GCCTACAGCA GCTTCAGCTA TGCGGCGAAC CGTGGCTTCG GCGACTATGA CGACGACGTG CACTACGCCA CGGCGAACGG CTCCACGGCG AAGCTGTCGT TCTCCGGAAC CGGCGTCCAG GTCTTCGGCG AGCAATACAC GGACCAGGGC AACATCGGAA TCAGCATCGA CGGTGGCACT CAACAGACGG TCAGCACAGT GCCGGCCGAC GGCCAGCGCC ATGCGAACGT CGTCGTATAC GCGGCGACCG GACTCGCTCG CGGGAGCCAC ACCGTTGTCG TGACGAAACT GTCCGGGCAG TACACGACCC TCGACGGCTT CGTCATCATT CAGTAG
|
Protein sequence | MGLSILGRRL TAWTGTAALT AAVAVSAAAQ TSAATAVTFY ASPSGSGSAC SQAAPCSLSG AQGAVRSQLA ATPGADVTVL VQDGTYRLAA TWAFGAADSG SAGHPVVWQA APGAHPVISG ASQVTGWTQV GTSGVWSAVV PAGSASRQLY VSGAEAPIAM STPSALGFAG GWSGTSTGYS IGGDAAASAW FGALSAAQVA GVEFDYPGGN GAWTESRCRV ASYSATAKTL TMAQPCWTDT TARASFSQGS GGLPSMSTGT MPTLIEDARA LIHPGQWFLD SAANTLYYQP ATGQQMSALD VELPRLESLL QGAGTLAKPL HDVTFRGLQF SYATWNAPSA ASGFADVQSN LRMTGAKNQG MCTFSSPAGT CPWGALTQPT ANVAFTASNN VTLTGNRFAE LGGAGLSVMY GSANTLIQGN EFTDIASTAI LLGCTYDPLP TDASESAGIK QNCTPNGSAV SADVIGTNEI LTGTTVSDNI VHHIGTDYSS ACGITLLFSR GTTITHNDLY DLPYTGITAG VIQGHVDQAS APQNSTNINE NNTLSDNVFH NYLSVRSDGG AIYAEGHQTQ YVYQSGGTTI DPVQTLAHGL QVTGNIAYHG PTTNFTYYDD AGSEWINWQG NVAFGAGSAS QGGCSPTGHF WIVGNYFSNQ TQYYPCNAPV DSNVSGTTTI SATPAPGDVP NGLFSAAGVR AANSALAVAA GPKIYYASPT TSTSTQVLIG GEGFSSSTPV FVGSTQVSGV QYLSGGFLIV PVPAGTPSSQ ISVGAPAGTS RLNDTDPSIT YSGFSYSSNR GLGDYDDDLH YATANGSTAK FSFSGTGVQV FGEQNTDQGN IGISIDGGTQ QTVSTVPADG QRHSNVVVYA ASGLAAGSHT IVVTKLSGQY ATLDGFQALN SRLNDTDPSI AYSSFSYAAN RGFGDYDDDV HYATANGSTA KLSFSGTGVQ VFGEQYTDQG NIGISIDGGT QQTVSTVPAD GQRHANVVVY AATGLARGSH TVVVTKLSGQ YTTLDGFVII Q
|
| |