Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5033 |
Symbol | |
ID | 8336387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5767639 |
End bp | 5770644 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644958132 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003115734 |
Protein GI | 256394170 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCA GTAGACGTAC AACCGGCGCC CTGCTGGCAG GCGCCCTCGC CCTGGCGGGC CTGTCCACCT CGGCCGCCCT GACCGCGGCG CCCGCCCACG CCGCGTCCGC GCCGACCACC CCGATCTGGT CCACCCAGCT CGACTTCGAC AACGGCGGCG CCGCCTGGTC AGAGCCCTAC TTCGCGGCGC TGGCGGCCAA AGGGCTGACC ACCGCCGAGC TGAACATGCC CTGGGGCACG ATCGAGCCGT CGGCCGGGAC CTTCAGTTTC ACGATCTGGG ACCAGGAGTT GGCGAACGCC GCCGCTGCCG GCATCCAGCT GATCCCGGTC TTCTGGCAGT CCGGGTGGGG CGGCAGCCCC GCACCGTGGA TCACCGACTT GGAGAAGACC AGCACCGGGG CGGCAGGCGT GGCTCCGGAC TGGTGGAACA CCACCGAGCA GGCGCAGTAC TTCACCTATG TCGAGAACAC CATCCAGAAC TCCATCGCAC AGCCCGGCGG CTACGGCGGC GCGGTCCTGG ACTACGGATT CCTCGACGCG CAGTGGGACA TCAGCGGCTC CGGCGGCGGC TATGCCAGCG GCGACATCAC CGAGTTCCAG AACGTGTACC TGCCGAACGC CTTCGGCACC ATCGCCGCCT TCAACGCCGC CGAGGGCACG TCCTACACAG CCTTCAGCCA GGTACCTGCG CAGGCTTCCG GACAGCCGTT GTTCGGGGTG TTCCAAGCCT TCCGCGCCTG GAGCGTCGAG CAGACCTACG GTGCGCTGAC CGCCGCCGTC CGCAAGATCA CCGCGAACAC GCCGCTGTAC TACTACTACG GCGGCAGCTA CGGGAACGTG ACGAACTACG CCAACAACCC CGACAGCTTC TTCAAGCTCG CCAAGCAGTA CAACGTCACC ATCATCGCCG ACTCGGCCAG CAACACCGGC ATGACGCTGG CGATGACGAG CCTCGGGCGC GCCTACGGCG TGAAGGTCGC CGAGGAGTGG ACGGCGCCGA ATTCGGACTC TGAGTTGGCC GCGTACGCCG TGCAGTGGCT CGACAGCTAC GGGATGACGT TCCCGCAAGC CGGCGGCGAG GACTTCTTCA TCCACGACGG CACCTCGAAG GACACCGTCG GCTACCCGAT CTACACCAGC TGGCTGCCGA CCCTGAAGAG CCTGTCGGGC ACCTACCCGC AGCAGCCCAC CGCGCTGTAC ATCGACGTCT CGCAGGGCTA TGGCAACACC AACGGCGGCA GCCTGAACAC CGTGGAGAGC CAGGCGGCGG CCATCTGGAA CAGCTTCCAG TCCGGACTGG CGGTCGTCAC CAGCCAAGAG GTGGCGAACG GCGCGGTGAG CCTGTCCTCG TTCAACGCCG TGCTGCCGCT CAACGGGGTC GATGCGAATC TGACCTCGTA CAAGAACGGC GGCGGCGCCC TGCTGACGTC CGCGGCGCAG CTGACTCAGC ATGCGAGCGC CTACGCGGTG ATCGACGCGC CCTACGTCGG CGACGTGCAA GCCGTGCCGG TCCTGGCGGC CAGTCACACC AGTGCCTCGC TGACCTTGGC GGACATCACC ACCGGAACCG CCTACAACGC GCCGATCGCG ATCAACCCGG CCGGGCTCGG CCTGAACTCG GGCAGCTACT ACGTCGTCAA CGCCGCCGGG ACAGCACTCC CCCAGACCGT CCAGTCGAAC GGACAGATCT GCGTGAGCGC GAACCTCGGC GCGGCGAGCC TGGCCGAGTG GACCGTCAAG GCCGGGCCGG TGCCCGCCGG GACCGCCTCG TCCGGCTGTC CGACCACGTA CACCGGAGCC ACGTCGGTGA GCGCCACCGC CGGCCAGTCC GGCGGCGGGT TGACCTTCCT GGGCGTCGGC GCGACGAACC AGGGCTCTGA CGGAAACCTG ACACAGATCA CCCAAGGCGG CCAGACCGCC TATGAGACCT GGACGTCCGC GCAAAGCGGC GCGACCGGCT CGGCCGACGT CTACCTCCAG GCGGCGCCGA TGTCCGCGGT CGAGGCGGCC GCGACCATCT CGATGCAGGT CACCTATTGG GCGACCGCCG GTCAGGGCTT CACCGTGCAG TACAGCACGC CGACGAACAA GTACCAGAAC GGACCGAGCG TCACCAGCCC GGGCACCGGG ACCTGGACCA CGGCGACCGT CCAGCTCACG AACGCCCAAC TCGGCGAGTT GGAGAACGGA GGCGCCGACC TGCGGCTCGC CGTCGCCGAT GTCACCACGC CGCTGATCGT GCGCAGCATC ACCATGTCGG CCGGGAACAG CAGCGCGCCG GTCCTGGCCG CGACGCCGAG CTCGCTGTCG TTCGGCAGCG TGAGCACCGG TTCGACCAGC GCGGCCCGCA CGGTGACCAT CACCAACTCC GGCAACGCCG CGGCGAGCGT TTCCAGCATC TCGACGACCA GCGGCTTCGC CCAGACCAAC ACCTGCGGAT CCAGCATCGC CGCGGGAGCG AGCTGCACGG CGAGCGTCAC CTTCTCCCCC ACCGCCGCTC AGACCTACAG CGGCAACCTG ACGGTCACCA GCACCGCGAC CGGCAGCCCT CTGATAGTCG CGCTGTCCGG AACGGGCACG AGTTCGAGCA CGAACCTCGC GCTCAACAAG CCGATCAGCG CCTCTACTGT CCAGCAGAAC TACGTCCCCA CCAACGCCGT CGACGGCAAC ACGGGCACGT ACTGGGAGAG CCGGGACGGG ACCTGGCCGA GCAGTCTGAC CGTGGATCTG GGTTCGACAC AGACGCTCAG CCACACGGTC ATCGACCTGC CACCGCTGTC CGTCTGGCAG ACGCGGACCC AGACCCTGTC CGTCCTGGGC TCGACCAACA ACTCCACCTG GACGACCATC GTCGCCTCAG CGGTCTACAC GTGGAATCCG AGCACGGGCA ACACCGTGAC CATCACGTTC CCCGCCGGCA CGGCGTACCG GTACGTGCAG CTGAACTTCA CGGCGAACAA CGTGCAGAAC GGCGCGCAGG TCTCCGAGTG GCAGCTCTTC GGCTGA
|
Protein sequence | MRISRRTTGA LLAGALALAG LSTSAALTAA PAHAASAPTT PIWSTQLDFD NGGAAWSEPY FAALAAKGLT TAELNMPWGT IEPSAGTFSF TIWDQELANA AAAGIQLIPV FWQSGWGGSP APWITDLEKT STGAAGVAPD WWNTTEQAQY FTYVENTIQN SIAQPGGYGG AVLDYGFLDA QWDISGSGGG YASGDITEFQ NVYLPNAFGT IAAFNAAEGT SYTAFSQVPA QASGQPLFGV FQAFRAWSVE QTYGALTAAV RKITANTPLY YYYGGSYGNV TNYANNPDSF FKLAKQYNVT IIADSASNTG MTLAMTSLGR AYGVKVAEEW TAPNSDSELA AYAVQWLDSY GMTFPQAGGE DFFIHDGTSK DTVGYPIYTS WLPTLKSLSG TYPQQPTALY IDVSQGYGNT NGGSLNTVES QAAAIWNSFQ SGLAVVTSQE VANGAVSLSS FNAVLPLNGV DANLTSYKNG GGALLTSAAQ LTQHASAYAV IDAPYVGDVQ AVPVLAASHT SASLTLADIT TGTAYNAPIA INPAGLGLNS GSYYVVNAAG TALPQTVQSN GQICVSANLG AASLAEWTVK AGPVPAGTAS SGCPTTYTGA TSVSATAGQS GGGLTFLGVG ATNQGSDGNL TQITQGGQTA YETWTSAQSG ATGSADVYLQ AAPMSAVEAA ATISMQVTYW ATAGQGFTVQ YSTPTNKYQN GPSVTSPGTG TWTTATVQLT NAQLGELENG GADLRLAVAD VTTPLIVRSI TMSAGNSSAP VLAATPSSLS FGSVSTGSTS AARTVTITNS GNAAASVSSI STTSGFAQTN TCGSSIAAGA SCTASVTFSP TAAQTYSGNL TVTSTATGSP LIVALSGTGT SSSTNLALNK PISASTVQQN YVPTNAVDGN TGTYWESRDG TWPSSLTVDL GSTQTLSHTV IDLPPLSVWQ TRTQTLSVLG STNNSTWTTI VASAVYTWNP STGNTVTITF PAGTAYRYVQ LNFTANNVQN GAQVSEWQLF G
|
| |