Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_6157 |
Symbol | |
ID | 8337520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 7085052 |
End bp | 7087808 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644959258 |
Product | Fibronectin type III domain protein |
Protein accession | YP_003116852 |
Protein GI | 256395288 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.622603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.669224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGAC GGAAACGTTG GATCGCGGGC GCCGCGTTCC TCGCCCTCAT CGCGGGCGGG AGCATCGGAG CCGCACAGGG GCTGCAGAGT ACGAGGCCGT CATCCGGCCC CGGCTCGTGC ACCCTCATCG GCTGGAACCC GAGCACCGAC CCGACGAACG CCAAGAACCT GCCGGTCGGC CAGCGGCCCC AGTCCTACCG GCCCGACAAC TTCGACTGCT CCAGCGCCAA GTTCGCCGCG CTCGGCTCGG AGTTCACGAA GTTCCCGCAG CCGCACGACT TCTCGGTGAA CAACCGCATC ACCTACGAGC CCGCCGCCGC CGGCAAGCCG GCCACGGCGA TCCAGCAGCC GGTGGCGGCG GTGAACCCGC TGACGCCGTA CTTCCCGCCG TTCCAGCACT TCGTCATCGT CTACCGCGAG AACCACACGT TCGACGACTA CCTCGGCGAC TGCGCCACCA CCATCTCGGC CTCCTGCAAC GGCAAGGTGC AGAGCACCAA CCACATCAGC TCGGTGCCGG ACCTGCACAG CCTGGCCAAG ACCTACGCGC TGTCGGACTC CTACAGCACG GGCACGCAGC CGCCCTCCGG TCCCAACCAC TGGTGGCTGT TCTCGGCCCA GTCGCAGTCC AGCTCCCAGC AGCAGACCTA CCCCTCCACC GGGACGGAGT TCGACCGCTT CCTGAGCAAC ACCAACGGTC CCACCAACGA GGGCACCAAC GCCTGCACGG CGCCGACCGG CAACGGCAGC GGCTCCAGCC CGTACACGTT CGTCATGAAC GGCGACTTCT ACTGGATGCT CAGCAGCGGC TCGGGCTACT GGAAGAACCC CGCCGACGGC AAGACCGAGG TCCTGCCCCC CAACCGCCCG GGCACGAGCA TCCCGGAGGA GCTGCACTAC AACGAGTACA CCTGCTCCAA CCAGAGCCTT CCGGACAGCA CCATCGCCAA CGACTACATC AACTTCGTCA ACACCAACGG CCTGCCGGTC TACAGCTACG TCGAGCTGTT CAACGACCAC CCGGGCACCT ACCAGGACAT CCCGGGCAAC GACACCGCGA CCAACAACGT GGTCAACGCG ATCGAGAACA ACGCGACGTA CAAGAACAAC ACCCTGATCA TCGTCACCGA GGACGACACC CAGAACGGCA ACAACGGCAC CGACCACGTC AGCAACACCT ACCGGGTCCC GCTGGTCGTC ATCGGCCCGT CGCAGTACGT CAAGCAGGGG TATGTCAGCC ACGTCGCGTA CACGACCAAC AACGTGATCG CGGCGATGGA ACGGACGATG CAGAACGTCA AGGCCGGCAT CATCGACCCC AACGACAACA TCGGGCTGAA CACCTTCCCG ATGACCACCA ACGACCAGGC CGCGCTCGGC GACCCGCTGG AGGACTTCTG GGTCCAGGGC TCGACCCCGC TGTCCGCCTC GGCCACGGCC TCCCCGACCA CCGGCAACGC ACCGCTCGCG GTGAACTTCA CCGGCTCGGC GACCGGCGGC ACGGCGCCGT ACAAGTACAG CTGGAACTTC GGTGACGGCG CGACGAGCAC CACGCAGAGC CCCAGCCACA CCTACAGCTC CGCGGGCAGC TACACCGCGA CGCTGACGGT CACCGACACC TCCTCCCCGG TCAAGACGGC GACCTCCCAG GTCGCGGTGA ACGTCAGCTC GGTCGGCAGC CCGCTGGCGG CCTCGGCGGC GGGCACTCCG ACCTCCGGAC AGATCCCCTT GGCCGTGAAC TTCACGGGAA CGGCGACAGG AGGCACTCCC GCTTACCACT ACAGCTGGAA CTTCGGTGAC GGCTCCGCGA CGAGCACGGC GCAGAACCCG AGCCACAGCT ACACCGCCGC CGGGACGTAC ACGGCGACCT TGACGGTGAC CGACAGCGCC TCCCCGGTGA ACACGGCCAC GTCGACGGTC AGCGTCACGG CGTCGCCGAT CATGGGCACG CCGCCCAGCG CGCCACAGAA CCTCACCGCG GCAGCGGGTA CCAACCAGGT GACGCTGAAC TGGCAGGCGC CGGCGAGCAG CGGCGGTGAG AACATCACCA AGTACAGCGT CTACCGCGGC ACGTCCAGCG GCACGGAATC GCTGGTGACC TCCGGCGGCT GCAGCGGCGT GAGCGGCAGC ACTTTGACGT GCACGGATAC TGGGCTAACC AGTGGTGTTG ACTACTACTA CCGCGTCACC GCGTCGAACC CCATCGGCGA GGGCGGGCAG AGCAACGAGG TCACCGCCAC GCCGACCGGC AGCACCGGAT GCACCGCCGG GCAGCTGCTG GGCAACCCGG GCTTCGAGAA CGGCGCGTCC AACCCGGCGC CGTGGGCGAT CACCTCGACG CACACCCCGC TCTCGGTGAT CAACAGCAGC AGCTCCGAGC CGCCGCACGG CGGGACCTAC GACGCGTGGA TCGACGGCTG GGGCAAGGCG ACCACGGACA CGTTGGCCCA GACGGTGACG CTGCCGTCGG GGTGCACGAC CGAGAAACTC AACTTCTATA TGCACATCGA CACCGCGGAG ACCACGACCA CCACCAAGTA CGACACGCTC AAGGTGCAGG TCCTCAACCC GGCGGGCACC GTGCTGGGAA CCCTGTACAC GTACTCGAAT CTGAACGCCA ACACCGGATA TTCGCTGCAC TCGCTCAGCC TGGCGTCCTA CGCCGGGCAG ACGGTGACGC TGAAGTTCAC CGGCGTTGAG GACGCCGAGT ACCAGACCTC GTTCGTCGTC GACGACGCCA CGGTCAATGT GAGCTGA
|
Protein sequence | MIRRKRWIAG AAFLALIAGG SIGAAQGLQS TRPSSGPGSC TLIGWNPSTD PTNAKNLPVG QRPQSYRPDN FDCSSAKFAA LGSEFTKFPQ PHDFSVNNRI TYEPAAAGKP ATAIQQPVAA VNPLTPYFPP FQHFVIVYRE NHTFDDYLGD CATTISASCN GKVQSTNHIS SVPDLHSLAK TYALSDSYST GTQPPSGPNH WWLFSAQSQS SSQQQTYPST GTEFDRFLSN TNGPTNEGTN ACTAPTGNGS GSSPYTFVMN GDFYWMLSSG SGYWKNPADG KTEVLPPNRP GTSIPEELHY NEYTCSNQSL PDSTIANDYI NFVNTNGLPV YSYVELFNDH PGTYQDIPGN DTATNNVVNA IENNATYKNN TLIIVTEDDT QNGNNGTDHV SNTYRVPLVV IGPSQYVKQG YVSHVAYTTN NVIAAMERTM QNVKAGIIDP NDNIGLNTFP MTTNDQAALG DPLEDFWVQG STPLSASATA SPTTGNAPLA VNFTGSATGG TAPYKYSWNF GDGATSTTQS PSHTYSSAGS YTATLTVTDT SSPVKTATSQ VAVNVSSVGS PLAASAAGTP TSGQIPLAVN FTGTATGGTP AYHYSWNFGD GSATSTAQNP SHSYTAAGTY TATLTVTDSA SPVNTATSTV SVTASPIMGT PPSAPQNLTA AAGTNQVTLN WQAPASSGGE NITKYSVYRG TSSGTESLVT SGGCSGVSGS TLTCTDTGLT SGVDYYYRVT ASNPIGEGGQ SNEVTATPTG STGCTAGQLL GNPGFENGAS NPAPWAITST HTPLSVINSS SSEPPHGGTY DAWIDGWGKA TTDTLAQTVT LPSGCTTEKL NFYMHIDTAE TTTTTKYDTL KVQVLNPAGT VLGTLYTYSN LNANTGYSLH SLSLASYAGQ TVTLKFTGVE DAEYQTSFVV DDATVNVS
|
| |