Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3717 |
Symbol | |
ID | 8335070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4182383 |
End bp | 4183693 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644956857 |
Product | Carbohydrate-binding family V/XII |
Protein accession | YP_003114460 |
Protein GI | 256392896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.609198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0379459 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGATCG GAAAACGGAA CCTTCGCGCA GCACTGACAG CCGGCTCGGC GTTCGCGCTG GCAGCCGGCG CGGCCGTCCT CGCGGTGGCT GCGGCTCCCA GTGCACAGGC CGCCACGTGC GCCGCGCCGT GGAGTGCCGC CACGGTCTAC ACCGGCGGCC AGCAGGCCAG CGAGAACGGG ACCAACTACA CCGCCAACTG GTGGACGCAG GGCAACGACC CGGCGACCAA CAACGGCGGA TCCGGCACCG GACAACCGTG GACGTCCAAC GGCGCGTGCA CCGGCGGTAC CGGCGGCGGC ACAGGCGGCA CAGGCGGTGG CACCGGCGGC GGAACCGGCG GCACCGGTGG CGGAACGGGC GGGGTGAGCG GCCTGCTGCT CAGCCCGTAC AAGGACGTCA CCGTCAACAT GAACTGGAAC ACCTACCAGA TGCAGTCGGC GGTGACCGGG TCCGTCATAC CGGTAGTCGG TTCCGGAAGC CTGGTGTCGC AGTACGTTCC GAAGCTGCCC GCGATCACCC TGGCGTTCGC CACCGGCTCC TGCGGCAGCG AGACCTGGGG CGGCGTCCCG GCTGCCAACT TCGCCTCGGA GAACGTGGCC CAACTGCACG CCGCCAACCT GAACTACGTC GTGTCGACCG GCGGCGCCGC CGGCAGTTTC ACCTGCGCCT CGGCGTCCGG CATGAAGTCC TTCATCGCCC GCTACGCCAG CTCGAACCTG GTCGGCATCG ACTTCGACAT CGAAGGCGGC CAGAGCGCGT CGGACATCCA GAACCTCGTC GCCTCCGCGG TCGGAGCCCA GTCGCAGTAC CCGAACCTGC AGTTCTCCTT CACCCTGGCC ACCCTCGGGG CCTCCGACGG CAGCTACGGC GGAGTGAACT CCCTCGGCAA CACGGTCGTG CAGGCCGTCC GCGGCTCCAG CCTGAACCAC TACGTCATCA ACCTGATGAC CATGGACTAC GGCAGCGCCT CCAGCAGCGT GTGCGTCGTC TCCGGCGGCA CCTGCCAGAT GGCCCAGTCG GCGATCCAGG CGGTGAAGAA CCTCGAGCAC ACCTACGCAA TCCCGGCCAG CAAGATCGCC GTCACCCCGA TGATCGGCAT GAACGACGCC ACCAGCGAGA TCTTCACCGT CGCCGACGTC AACACCCTGT CGTCCTACGC CGTCAGCAAC GGCCTGGCCG GCCTTCACTA CTGGTCACTG GACCGAGACA CCCCCTGCTC AAGCTCCTAC GCGTCCCCCA CCTGCAACTC CGTCCCCAGC ACGACCCCGC TGCAATACAC CAAGCAGTTC ATGAGCGACA CCGGGCACTG A
|
Protein sequence | MRIGKRNLRA ALTAGSAFAL AAGAAVLAVA AAPSAQAATC AAPWSAATVY TGGQQASENG TNYTANWWTQ GNDPATNNGG SGTGQPWTSN GACTGGTGGG TGGTGGGTGG GTGGTGGGTG GVSGLLLSPY KDVTVNMNWN TYQMQSAVTG SVIPVVGSGS LVSQYVPKLP AITLAFATGS CGSETWGGVP AANFASENVA QLHAANLNYV VSTGGAAGSF TCASASGMKS FIARYASSNL VGIDFDIEGG QSASDIQNLV ASAVGAQSQY PNLQFSFTLA TLGASDGSYG GVNSLGNTVV QAVRGSSLNH YVINLMTMDY GSASSSVCVV SGGTCQMAQS AIQAVKNLEH TYAIPASKIA VTPMIGMNDA TSEIFTVADV NTLSSYAVSN GLAGLHYWSL DRDTPCSSSY ASPTCNSVPS TTPLQYTKQF MSDTGH
|
| |