Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2566 |
Symbol | |
ID | 8333915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2905713 |
End bp | 2907263 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 644955719 |
Product | extracellular repeat protein, HAF family |
Protein accession | YP_003113325 |
Protein GI | 256391761 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATTCGT CCGGCAGGCG TTCGGCAGGC GCCGTCGCCC GTGTCGCAGT ACTCTGCGCC GCTCTGGTCG TGCCCGTCCC GTTCGCGGTA TCGGCATCGG CTGGTGTGGC CGGCGCGGCC AGTGCGGCCA GTGCGGCCAC CGCGCCGCTC TCCACCATCA CCGACCTCGG CACGCTCGGC GGTGATCTCA GCATCGCCAA CGGGATCAAC AACGCCGGGG TCGTCGTCGG CTACAGCGAT CTGGCTTCCG GCACTCAGCA CGGCTTCCGC TGGTCCGGAG GGACCATGTC CGACCTGGGG GTCGAGGCCG GGGGCGGGGA CAGTGTGGCG AACGCCGTCA ACGACGCCGG TCAGATCGCG GGCCAAGCGA CGCGCGCCGA CGGCGGTTAC GCGTATCCGG TCCGCTGGAG CGCCGCCGGC GTGCTGCAGG ACCTCGGCGG CCCGATCACC AACCGGCTGG GCGTCGGCAA CGCCATCGAC CCCTCCGGCC GCGTCGCCGG CGGTCAGCGT CCGGCCGACT CCGAGGGCAG CCCGGAGGCG ATCGTCTATG ACGCCGCCGG CAACCCCACC GAGCTGAGTA CGCCGACGCA GACCCTCAAC GCGGCCACCG GCATCAACGC GCGCGGGCAG GTCGTCGGCG GTCCGGCGTT CGTCTGGCAG AACGGGTCCC TGACCATGCT GCCGGTGCTG CCCGGCGGTC AGGGCGGATC GGCCAACGCC ATCAACGTCT CCGGCACGAT CGTCGGCACC GTCAGCCGAA CCGGCACGCT CAGCGGTCTG GACGCCGCGC TCTGGCAGAA CAACACCCTG ACAGACCTCG GCACGATCGA CGCGATCCAG TACAACCAGG CGACCGCGGT CAACGCCGCG GGCCAGATCG TCGGTACCGC CGACCCCGAG TGTCAGCCGT GCGCCGCACC GGAGGCGTGG CTGCGCCAGC CGGGCGGCGC GCTGACGAAG CTGGACACGC TGCTCCCCGC CGGCTCCGGC TGGACCCTCC AATCAGCCAC CGGGATCAAC GACCGCGGCC AGATCGTCGG CGTCGGCCTC CACAACGGCC ACAAGCGCGG CTACCTGCTC ACCCCGGCGT TCGCCGCGAC CGTGAACTTC GAACCGGCCG GCTCGACGAT CCCGGTGGGC TACGCGGCGG ACACCGGCGC CGCGTACGGT GCGCGGTCCG GCGGCCTGAC CTACGGCTGG AACATCGACA ACTCCGTGAA CACGAGGGAC CGCAACGCCT CGAGCTCCCC GGATCAGCGC TATGACACGC TGATCCACAT GGAGCGCAGC GGAAGCGCGA CGGTGTGGGA GATGGCGGTG CCGAACGGCC ACTACACGGT GCACCTGGTC TGCGGCGATC CGTCGAACAC CGACAGCGTC TACAAGGTCA ACGTGGAGGG CGTGCTCACA GTCTCAGGGA CGCCGAGCGC CGCCAGCCAC TGGATCGAGG GGACCAGCCA GGTCACGGTC TCCGATGGCA AACTGACCAT CACCAACGCC ACCGGATCGA GCAACGACAA GCTCGCTTAC GTGGACGTCA TCGCTTCCTG A
|
Protein sequence | MHSSGRRSAG AVARVAVLCA ALVVPVPFAV SASAGVAGAA SAASAATAPL STITDLGTLG GDLSIANGIN NAGVVVGYSD LASGTQHGFR WSGGTMSDLG VEAGGGDSVA NAVNDAGQIA GQATRADGGY AYPVRWSAAG VLQDLGGPIT NRLGVGNAID PSGRVAGGQR PADSEGSPEA IVYDAAGNPT ELSTPTQTLN AATGINARGQ VVGGPAFVWQ NGSLTMLPVL PGGQGGSANA INVSGTIVGT VSRTGTLSGL DAALWQNNTL TDLGTIDAIQ YNQATAVNAA GQIVGTADPE CQPCAAPEAW LRQPGGALTK LDTLLPAGSG WTLQSATGIN DRGQIVGVGL HNGHKRGYLL TPAFAATVNF EPAGSTIPVG YAADTGAAYG ARSGGLTYGW NIDNSVNTRD RNASSSPDQR YDTLIHMERS GSATVWEMAV PNGHYTVHLV CGDPSNTDSV YKVNVEGVLT VSGTPSAASH WIEGTSQVTV SDGKLTITNA TGSSNDKLAY VDVIAS
|
| |