Gene Shel_13970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_13970 
Symbol 
ID8395287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp1599433 
End bp1601004 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content61% 
IMG OID644986151 
Productcollagen triple helix repeat protein 
Protein accessionYP_003143767 
Protein GI257064095 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCTACC CGGCAGCGAT GTGCGAGGCT GCTGGTGTCG TGGATGCGAG CATCATGTTG 
TCGCTGGGCG ATGACCGCTA CTTTTCCACG CGCAACTTCA ACATCCGAGT CGAGAAGGTT
TTGATTGATG GTTTGGAACC CGAGGATGGG TTCACGCTGT TTGTGCAGGC CATTGCAGCA
TACGAGAATG CAGCGGACAT CAGCACGGAA GCCGCTGAGG CAGCCAATGA GGCGGCAGAG
GCGGCAAACC AAGCCGTTAG CGACTTGCAG GATGCCGCCC AGCGCGGGGA TTTCGATGGA
GCCGATGGCG CTGACGGGTT CAGCCCGACT GCCACTGTGA CGCAGACCGC CGATGGCGTG
ACCATTACCA TCACCGACAA GAACGGCACT ACGACGGCCG ATGTCTCTAA AGGCGCAAAG
GGAGACAAGG GCGATACCGG CGAGCAGGGT CCGAAAGGCG ACAAGGGTGA CACCGGAGAT
CGTGGACCTC AGGGAATTCA GGGTGAGACC GGACCCAAGG GAGATAAGGG CGATACAGGG
GCTACCGGTG CTCAGGGACC TAAAGGCGAA ACCGGAGAAA CGGGCGCAAC TGGTGCTACA
GGACCAAAGG GGCCAAAGGG TGATACGGGC GCACAGGGGC CGCAGGGCAT CCAGGGTGAA
ACAGGTCCCA AGGGCGAAAC CGGTGCGACT GGAGCAGCCG GCAGCGACGG CGTGAGCTGC
ACGCATTCGT GGAATGGAAC CGTGCTTTCC GTCACGAGCG CATCGGGCAC GAGCTCCGCC
GACCTTGTTG GGCCGCAGGG GCCAACCGGC GCTACAGGAG CTACGGGTCC CGCTGGCGCT
GACGGCACGA CATTCACTCC GCAGAACCCG CTTTCGCTTT CGAACGGCGA ACTATCCGTC
GATTTATCTG CAAACACGGA TACCCAGCAT TTAGCGCCGA CGTTCTGGCA GAAGTGGACT
TATCCGCGCC ACCCGGACAG CGAGGGCCGC TACTGGGTCT CGCTCGGGCA GGACACGAGT
CAGGGATACC GCTCATACGA CATGATCTTC AACGGGCAAT ATCGCGAGCT CGACGTGATA
ACGATGGTGT CGAAGAACAG CGGGTACGCG AGCGCGATCC TGCAGCGCGT GGCCGAGTTC
GGTTTGATGC GCAAGTTCGA GACGACCACC GACTTCGACG TAGACCCGGG CGATACGGAG
GCGGTGACGA TTCCTGCCGG TGACTTCTAC TCGGTGCCCG CCGACTGCGT CTTCCTGAAC
ACCACGACAG GCAACTTGAT GATTAACACC GCCGAAGCGG CTCCCGCCAG CAGAATCGCC
GCGACCTCTC TTACCGTGAC GGGGCTTTGC AACATATACG ACGTGGCTGC TGCATTCACG
GCATCGTCGC CGCTTTCGCT CTCCAACGGC GTGCTCTCGA TTGACCTCTC CGGTTATGCC
GCGCTTACGG GTGCCACATT CACTGGGGCC GTTTCGGGTA TCACACCGAC CGCTGATGCG
AACTTCGCCA CGAAGAAGTA CGTGGACGAC GCCATTGCCG CACTTGACGA CCTCTCGAAC
GAAAGCTTCT GA
 
Protein sequence
MFYPAAMCEA AGVVDASIML SLGDDRYFST RNFNIRVEKV LIDGLEPEDG FTLFVQAIAA 
YENAADISTE AAEAANEAAE AANQAVSDLQ DAAQRGDFDG ADGADGFSPT ATVTQTADGV
TITITDKNGT TTADVSKGAK GDKGDTGEQG PKGDKGDTGD RGPQGIQGET GPKGDKGDTG
ATGAQGPKGE TGETGATGAT GPKGPKGDTG AQGPQGIQGE TGPKGETGAT GAAGSDGVSC
THSWNGTVLS VTSASGTSSA DLVGPQGPTG ATGATGPAGA DGTTFTPQNP LSLSNGELSV
DLSANTDTQH LAPTFWQKWT YPRHPDSEGR YWVSLGQDTS QGYRSYDMIF NGQYRELDVI
TMVSKNSGYA SAILQRVAEF GLMRKFETTT DFDVDPGDTE AVTIPAGDFY SVPADCVFLN
TTTGNLMINT AEAAPASRIA ATSLTVTGLC NIYDVAAAFT ASSPLSLSNG VLSIDLSGYA
ALTGATFTGA VSGITPTADA NFATKKYVDD AIAALDDLSN ESF