Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_13970 |
Symbol | |
ID | 8395287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 1599433 |
End bp | 1601004 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644986151 |
Product | collagen triple helix repeat protein |
Protein accession | YP_003143767 |
Protein GI | 257064095 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCTACC CGGCAGCGAT GTGCGAGGCT GCTGGTGTCG TGGATGCGAG CATCATGTTG TCGCTGGGCG ATGACCGCTA CTTTTCCACG CGCAACTTCA ACATCCGAGT CGAGAAGGTT TTGATTGATG GTTTGGAACC CGAGGATGGG TTCACGCTGT TTGTGCAGGC CATTGCAGCA TACGAGAATG CAGCGGACAT CAGCACGGAA GCCGCTGAGG CAGCCAATGA GGCGGCAGAG GCGGCAAACC AAGCCGTTAG CGACTTGCAG GATGCCGCCC AGCGCGGGGA TTTCGATGGA GCCGATGGCG CTGACGGGTT CAGCCCGACT GCCACTGTGA CGCAGACCGC CGATGGCGTG ACCATTACCA TCACCGACAA GAACGGCACT ACGACGGCCG ATGTCTCTAA AGGCGCAAAG GGAGACAAGG GCGATACCGG CGAGCAGGGT CCGAAAGGCG ACAAGGGTGA CACCGGAGAT CGTGGACCTC AGGGAATTCA GGGTGAGACC GGACCCAAGG GAGATAAGGG CGATACAGGG GCTACCGGTG CTCAGGGACC TAAAGGCGAA ACCGGAGAAA CGGGCGCAAC TGGTGCTACA GGACCAAAGG GGCCAAAGGG TGATACGGGC GCACAGGGGC CGCAGGGCAT CCAGGGTGAA ACAGGTCCCA AGGGCGAAAC CGGTGCGACT GGAGCAGCCG GCAGCGACGG CGTGAGCTGC ACGCATTCGT GGAATGGAAC CGTGCTTTCC GTCACGAGCG CATCGGGCAC GAGCTCCGCC GACCTTGTTG GGCCGCAGGG GCCAACCGGC GCTACAGGAG CTACGGGTCC CGCTGGCGCT GACGGCACGA CATTCACTCC GCAGAACCCG CTTTCGCTTT CGAACGGCGA ACTATCCGTC GATTTATCTG CAAACACGGA TACCCAGCAT TTAGCGCCGA CGTTCTGGCA GAAGTGGACT TATCCGCGCC ACCCGGACAG CGAGGGCCGC TACTGGGTCT CGCTCGGGCA GGACACGAGT CAGGGATACC GCTCATACGA CATGATCTTC AACGGGCAAT ATCGCGAGCT CGACGTGATA ACGATGGTGT CGAAGAACAG CGGGTACGCG AGCGCGATCC TGCAGCGCGT GGCCGAGTTC GGTTTGATGC GCAAGTTCGA GACGACCACC GACTTCGACG TAGACCCGGG CGATACGGAG GCGGTGACGA TTCCTGCCGG TGACTTCTAC TCGGTGCCCG CCGACTGCGT CTTCCTGAAC ACCACGACAG GCAACTTGAT GATTAACACC GCCGAAGCGG CTCCCGCCAG CAGAATCGCC GCGACCTCTC TTACCGTGAC GGGGCTTTGC AACATATACG ACGTGGCTGC TGCATTCACG GCATCGTCGC CGCTTTCGCT CTCCAACGGC GTGCTCTCGA TTGACCTCTC CGGTTATGCC GCGCTTACGG GTGCCACATT CACTGGGGCC GTTTCGGGTA TCACACCGAC CGCTGATGCG AACTTCGCCA CGAAGAAGTA CGTGGACGAC GCCATTGCCG CACTTGACGA CCTCTCGAAC GAAAGCTTCT GA
|
Protein sequence | MFYPAAMCEA AGVVDASIML SLGDDRYFST RNFNIRVEKV LIDGLEPEDG FTLFVQAIAA YENAADISTE AAEAANEAAE AANQAVSDLQ DAAQRGDFDG ADGADGFSPT ATVTQTADGV TITITDKNGT TTADVSKGAK GDKGDTGEQG PKGDKGDTGD RGPQGIQGET GPKGDKGDTG ATGAQGPKGE TGETGATGAT GPKGPKGDTG AQGPQGIQGE TGPKGETGAT GAAGSDGVSC THSWNGTVLS VTSASGTSSA DLVGPQGPTG ATGATGPAGA DGTTFTPQNP LSLSNGELSV DLSANTDTQH LAPTFWQKWT YPRHPDSEGR YWVSLGQDTS QGYRSYDMIF NGQYRELDVI TMVSKNSGYA SAILQRVAEF GLMRKFETTT DFDVDPGDTE AVTIPAGDFY SVPADCVFLN TTTGNLMINT AEAAPASRIA ATSLTVTGLC NIYDVAAAFT ASSPLSLSNG VLSIDLSGYA ALTGATFTGA VSGITPTADA NFATKKYVDD AIAALDDLSN ESF
|
| |