Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4427 |
Symbol | |
ID | 8335781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5029492 |
End bp | 5033016 |
Gene Length | 3525 bp |
Protein Length | 1174 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957530 |
Product | Ig domain protein group 2 domain protein |
Protein accession | YP_003115132 |
Protein GI | 256393568 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.407706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.105272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGTCGA GGCGAGTTGG TGTGCGGCGG TTGGGATGGG GGCGGGGTGC TGTCGCGCTG GTTGTCGCTG TGGCGGGTTG TGTCGCGGTG GTGGCGCCCG CGCATTCGGT GCCTGGTGCC GTCGGGGCTG GTGGAGGTGG TGGTGCGGCT TCGGGTGCTT GGCTGCCGAC GACGCCGGAT CAGTGGCCGC TGGTTGTCGA CGAGGCCGTG TCGCCGAGTG TGACGCTTAC TCATGGTGTG CAGCAGCATT CTGATGTGCT GCGGACGGTC GGGGGTGCGC AGCGGGCGCA GGTGATGGAT GTCGACCTCG CTGATCCCAA TGTGCGGTTG GGTGTTGTGG AGTCTCACGA TCATCTCACC GATGCTGCTG ATGAGGTCCC GTCTTCCATG GCGCACCGGA CTGGGGCGGT GGGCGGGGTG AACGGGGACT TCTTCGAGAT CTATGGCAGC GGGCGGCCGC TGGGCATGGT CGTCATCGAC GGTCGGCTGG TGAAGAGTCC TGATCCGACG TGGAACGCCG ACCTGTGGGT GCGCCATGAC GGCAGCATCG GCATCGGCAC CGAGACCTAC GCCGGCAGCC TCACCGATGG CGCCGCCACC GCTGCGATTA CCGCCGTCAA CGCTGTGAAC AGCCTGTCCG GCAACGCCAT CGTGCGCGTC ACCCCCGATC TGGGCACCCC GTCGCCGATC GCGGCCTCGA CCGTCGTTGC CGGCCACCTC GGCGCGGACG GCACCACGCT GCTCGTCGAC TCCGTCACCG CCGGCGTGAC CACCCTGCCG CAGCTCGCCG CAGGGACCGA GGACCTCGTC GGCTCCGGGA CCCAAGCCGG GTGGCTGTCG CAGAACATTC ATATGGGCGA CCAGATCGCT GTCAGCGAGA AGATCGGTCC CGACCCCGAC GTCGTCCAGG GCCTGTCCGG CGGCGCGATC CTGGTCCAGA ACGGCCAGCG CGCCGTGCCG CTCCAGGGCT CCGGCGAGAA CAACGTCGAC AACCCGGTCA CCGCGGTCGG CGTCAGCCAG GACGGCAAGC ACGCCGTCTT CGCCGCCTTC GACGGCCATC AGAGCGAAGA CGTCGCCCAA GGCCTGACCC GCCCCCAGAT CGCCGGCTGG ATGACGCAGC ACGGCGCGTA CAACGCGATC CTGTTCGACA GCGGCGGCTC CACTCAGATG GTCGGCCGCC TTCCCGGACA GACTCAGGCT TCTGTCTTGA ACGTCCCCTC CGACGGCCAC GAGCGCCCCG TCGCCAACGG CCTGTTCATC TACAGCACCG AGAAGGCGCC CGGCACGGCG GTCAAGGCGG TCGTCAACAA CGGCAAGCCG CTGACGGCGC TGGCCGGAAC CACCGTCCCG CTCTCCGCCT ACGGCCTGGA CGCCGCCGAC AATCCGGCAC GCGACGCCGC CAGCCTCAAG GTCTCGCCGG GCAAGATCGC GACCGTGAAT GGCAGCACCC TGACGTTCAC CCGCCCCGGT CACGGCGTCC TGCACGTGAC CGCCGGGCAC GCGCAATCGA CCATCCCGCT GACCGTCGTC GACAGTCTCA AAACCCTGAC TGTCACCCCG AACCAGGTCG ATCTCGACAG CGGCGCGACA CAGCTGATCA CGGCCGCCGC CACGGCGCCC GACGGCAGCC CTGTCGACCT CCTCGCCGAA GCCGTGAAGT GGAGTGTCGA CCCACCCGCG CTCGGCACGA TCGCCCCCGA CGGCACCTTC ACCGCGGCGC CCACCGGCTC CGGCATGGTC ACCGTCACCG CGAGCGCCGG CGGACGCACC GCCACCGTCA GCATCGCCGT CGGCCGCTCG ACCACGAACG TGGACCCCCT CACCGACGCC TCGAAGTGGT CGATCACCGA CGCCTACATG AACGTCTACC CGCGCAAGGT CCCCAGCCCT GGCACGCACA GCACCTCCGA CGGCTCGATG TCCTTCGACC CGGCGACCAA GGCCCAGCTC GGTGACGCAG GGTCCTTCGA CATCCACTAC GACTACCCCT ACGTCAGCAA AACCCTCGAC CTGAGCGTGT ATCTCAACGA CCCGAACAGC GAACAGGTCC CGCTCCTGAA CGGCACCCAG GCGCCGATCG GCCTCGGCGT GTGGGTCAAG GGCAACCCCG ACCTCGCCTC GCGCCCGGGA GCCGGACTCG CGCCGGGCAT CGTCACCCTC AACGTGGGCA TCTGGCAGTC CACCAGCCAG CCGACCAGCT TCTACCCGAC CGGCGTCACC TTCGACGGCT GGCAGTACGT GGTCGCCCAA CTCCCGCCGG GCCTGCAGTA CCCGCTGCGC ATCAACTACC TCGGCCTGGT CGTCATCAAG CCCGGCCCGA ACCTGAGCGG CGACGTCCAC CTCGCCGACC TGCAAGCGGT CTACTCCCCG CGCCCGCCGG TCCCGCCGAC GTACACGGCG ATCCCGAAGA ACCCGTCGTG GCTGAGCTTC ACCGACGTCG GCTCGTTCAA GCCCGGCGGC ACGAGCATCG CAGCCTTCGA CGACGCGCAC ACCACGGCGG CGAACCCGGC CGCCACCGGA ACCGTGGCGA TGAAGGCGAT GCCCGGACTG CTGGCGAACC TGCCGACCGC CGCGAAACCC TCGATGGTTC AGGCACTCGG CGACATGTCC GACGACGGAC AGCTGCCGGA CCTGACCAAT CTCAAAACCC TGCTAGACGG ACTTGGCGTG CCATACCATG ACGCGGTCGG CAACCACGAG ATCACCCAGG GCGCGACGCC GGAAAACGCC AACTTCGCCA ACGTCTTCGG CGCCACGCAC TACGCCTACG ACCAGGGCGC GGCGCGCGTC ATCGTCACCG ACAGCGGCCA CATCGGCATC ACCGCCTCCG ACCCGTATCA GACGGTTGAT ACCGACGAGT CGCAATACCT GTGGCTGGCG CAGCAACTGA CGCAGAACAC GCAGAAGGTC GCGATCGTCA CCACGCACGT CCCGGCGTAC GACCCGCACC CCCGCGCCGA CAGCCAGTTC TCCGACCGCT TCGAAGCCGA GATGTACGAG CGGCTCGTGC AGCGGTATCA GCAGACGCAC CCCGGCGTCC ACGTCATGAT GCTGTTCGGC CACGCCCGAG GCTGGGCTGA GGACGTGCGG CTGCCCGACG GCACCGAATC GCCCGACGGC ATCCCGAACT TCGTCGTCGC CGACCTCGGC GCCCCGCCGT ACGCGCCGGA AGACCAGGGC GGCTTCTACA ACTACGGGCT CTTCAACGTC CTGCCGAACG GCACGGTGCA GTTCGCGGTG CAACCGCTGC TGGCCTCGAT CGCGGTCACG GCGCCCGCAC CACAGCTGAC ACTCGGCGCC AAGGAGCAGC TGAGCGCGGT GGGCACGTCG CTGACCGGGA CGGACGCCCC GGCGTTGCAG GTACCGATCG CCGATCCGGT GTCGCGGCAC TGGACGTCGT CGGACCCGCG CATCGCCTCC GTCGACCCGT CCAGCGGCGT GCTGACGGCC CACTGCGCGG GGACCGTGAC GGTCGCCGTG ACCGCCGGCG GCATCACAAG CACCGCTTCC GTCACCGCGA AATAG
|
Protein sequence | MVSRRVGVRR LGWGRGAVAL VVAVAGCVAV VAPAHSVPGA VGAGGGGGAA SGAWLPTTPD QWPLVVDEAV SPSVTLTHGV QQHSDVLRTV GGAQRAQVMD VDLADPNVRL GVVESHDHLT DAADEVPSSM AHRTGAVGGV NGDFFEIYGS GRPLGMVVID GRLVKSPDPT WNADLWVRHD GSIGIGTETY AGSLTDGAAT AAITAVNAVN SLSGNAIVRV TPDLGTPSPI AASTVVAGHL GADGTTLLVD SVTAGVTTLP QLAAGTEDLV GSGTQAGWLS QNIHMGDQIA VSEKIGPDPD VVQGLSGGAI LVQNGQRAVP LQGSGENNVD NPVTAVGVSQ DGKHAVFAAF DGHQSEDVAQ GLTRPQIAGW MTQHGAYNAI LFDSGGSTQM VGRLPGQTQA SVLNVPSDGH ERPVANGLFI YSTEKAPGTA VKAVVNNGKP LTALAGTTVP LSAYGLDAAD NPARDAASLK VSPGKIATVN GSTLTFTRPG HGVLHVTAGH AQSTIPLTVV DSLKTLTVTP NQVDLDSGAT QLITAAATAP DGSPVDLLAE AVKWSVDPPA LGTIAPDGTF TAAPTGSGMV TVTASAGGRT ATVSIAVGRS TTNVDPLTDA SKWSITDAYM NVYPRKVPSP GTHSTSDGSM SFDPATKAQL GDAGSFDIHY DYPYVSKTLD LSVYLNDPNS EQVPLLNGTQ APIGLGVWVK GNPDLASRPG AGLAPGIVTL NVGIWQSTSQ PTSFYPTGVT FDGWQYVVAQ LPPGLQYPLR INYLGLVVIK PGPNLSGDVH LADLQAVYSP RPPVPPTYTA IPKNPSWLSF TDVGSFKPGG TSIAAFDDAH TTAANPAATG TVAMKAMPGL LANLPTAAKP SMVQALGDMS DDGQLPDLTN LKTLLDGLGV PYHDAVGNHE ITQGATPENA NFANVFGATH YAYDQGAARV IVTDSGHIGI TASDPYQTVD TDESQYLWLA QQLTQNTQKV AIVTTHVPAY DPHPRADSQF SDRFEAEMYE RLVQRYQQTH PGVHVMMLFG HARGWAEDVR LPDGTESPDG IPNFVVADLG APPYAPEDQG GFYNYGLFNV LPNGTVQFAV QPLLASIAVT APAPQLTLGA KEQLSAVGTS LTGTDAPALQ VPIADPVSRH WTSSDPRIAS VDPSSGVLTA HCAGTVTVAV TAGGITSTAS VTAK
|
| |