Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7141 |
Symbol | |
ID | 8338509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 8304216 |
End bp | 8305832 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644960222 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003117811 |
Protein GI | 256396247 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.174227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.195256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGGCA TCCCCACCCC CGGCCGATCA CTGCACATAG TCTCGGCGGG CGCGGCGCTC ACGATCGCCG CTGCCGCCGC ACTGGTCACG ATCAGCTCCT CCGCCCAGGC CAGCGCCACC AACCTCGCCT GCGGGCGGCC CGCCACCGCG TCGTCGAACT CTGCGACAGC AGGCAACGCC GACGACTGCG CGTCCGGCAC CGTCTGGCAG AGCGGCACCA GCAAACCGCA GCAGTGGCAG GTCGATCTGG GCTCCGACAC GACCGTCGAC CACGTGAGCG TGACCTGGGG CGCCGGGTAC GGCACCAACT ACAAGATCCG CACCTCCGAG GACGGATCCA GCTGGCACAC GGTCGTCGCC ACGACCACCG GCCACGGCGG CACCGAGACC CTCGCCCTCC CGGCCAACAC CGTCACCCGC TGGATCCAGG TCTACCTGAG CCAGTACTCG GGCACCGCGG GCTTCACCAT CGATGAAGTG GCCGTTTACG GCACCCCCGG CGCGCCCGGC TCATCGTCGA CGCTGCCCAC CACGCCTTCC ACGACACCCT CCACCACCCC GTCGACGACA CCTTCGACCA CGCCGTCGAC AACGCCCTCC ACCACGCCCT CCACCACCCC GTCCACGACG CCGTCGTCGT CCCCGCCCGG CGGCAAGACG TGGAACGTCA GCACCCCCGC CGCGCTGACC GCGGCCCTGG CCGGCGTGGC TCCCGGCGAC ACCATCGTGC TGGCGCCAGG CAGCTACGAC GGGGCGTTCT ACACGTTGAC GTCGGGGACC TCGAGCAAGC CCATCACGCT GACCGGCCCG CGCACCGCCA AGCTCTCCAA CAGCGCGTCA GCCTGCGACC CGAACTCGCC GCCGTCCAAC AGCGACGTCT CCTACTGCGG CTACGGCCTG CACCTGAACC ACGTCACCAA CTGGCACCTG ACCGGATTCA CCGTCACCAA CTCCTCCAAG GGCATCGTCC TCGACGGCTC CAGCAACAAC ACCCTGAACA GCGTCGAGGT CGACCAGATC GGCGACGAAG GCGTCCACTT CCGAGCCGAC AGCTCCAGCA ACCTGATCGA GAACTCCGCC ATCCACGACA CCGGCCGCGT CCAACCCGGC TACGGCGAAG GCCTCTACTT CGGCTCAGCG GAGAGCAACT GGGACAAGTA CGGCGACAGC ACCGGCCAAG ACCGCAGCAA CAACAACCAG GCCATCGGCA ACACCTTCGG CCCCAACATC GCCGCAGAAC ACATCGACAT CAAAGAAGGC ACCACCGGCG GCCTAGTCCA AGCCAACACC TTCACAGGCG GCGTCTCCGG CGAAAACAGC GCCGACTCCT GGGTCGACGT CAAGGGCAGC AACTACACCC TCACCGCCAA CCACGGCACC TACCCCCCCG GCGGCGTCCT AGCCGACGGC TACCAAGTCC ACCGCATAGT AGCCCCCTTC GGCTGCGGCA ACACCTGGAA GAACAACGAC TCCGACCTAG CCAACGTAGG CAACTACGCC ATCAACATCA CCGACCAATC CGACTGCGCC ACCAACCCCA ACATCGTCTA CAGCACCAAC ACAGTGACCC ACGCCGTAAA GGGCCTCACC AACATCCCGG TGACGGCCGG CGGCTGA
|
Protein sequence | MRGIPTPGRS LHIVSAGAAL TIAAAAALVT ISSSAQASAT NLACGRPATA SSNSATAGNA DDCASGTVWQ SGTSKPQQWQ VDLGSDTTVD HVSVTWGAGY GTNYKIRTSE DGSSWHTVVA TTTGHGGTET LALPANTVTR WIQVYLSQYS GTAGFTIDEV AVYGTPGAPG SSSTLPTTPS TTPSTTPSTT PSTTPSTTPS TTPSTTPSTT PSSSPPGGKT WNVSTPAALT AALAGVAPGD TIVLAPGSYD GAFYTLTSGT SSKPITLTGP RTAKLSNSAS ACDPNSPPSN SDVSYCGYGL HLNHVTNWHL TGFTVTNSSK GIVLDGSSNN TLNSVEVDQI GDEGVHFRAD SSSNLIENSA IHDTGRVQPG YGEGLYFGSA ESNWDKYGDS TGQDRSNNNQ AIGNTFGPNI AAEHIDIKEG TTGGLVQANT FTGGVSGENS ADSWVDVKGS NYTLTANHGT YPPGGVLADG YQVHRIVAPF GCGNTWKNND SDLANVGNYA INITDQSDCA TNPNIVYSTN TVTHAVKGLT NIPVTAGG
|
| |