Gene Shel_21030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_21030 
Symbol 
ID8395992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp2334057 
End bp2335607 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content61% 
IMG OID644986852 
ProductRHS repeat protein 
Protein accessionYP_003144463 
Protein GI257064791 
COG category 
COG ID 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAGTA CTGGTCTCCG GTCGAAGTTG CGTACGGCCC TCGCCGCCGC TCTGGCACTT 
TCGGCCGTGG CGGCGGTGGT CGCCGTCGTC TTGGCGGGAT GGGAGGCTTC GGGTTCCGAC
TACGCCCTGA CCAGCGAGGA GTTTCCCGAT AACCGGGTGC TGGCCGTCAT GGAGTCTTTC
GACACCGACG GTGACGGCGG GCTTTCGCGT GAAGAGGCCG CCGCCGTGAC CGAGGTGTCC
GTCGAAGGGA CGGAGGACCT GGCGTTTCTT GCTCTGCTTC CCAATGTGAC CACGCTTGCC
GTGTCGGGCA CCAACATCGA GTTCTTGAAT CTGAGCGGCG CGAACAACCT TGAGCGCCTG
ACGCTGGATT GTCCGGCGCT TACGGAAGTG ACCTGGGGAA AGACCGGTCA GCTGGCATAT
GTGGACGTGT CGGGCACGAA ACTTGCAGAA CTGGACGCTG CCGTGCTTCA GGCGGCGGCG
TATCTGGATG TGACCGGGTG CGAGAATCTT GCGGAGCTGA ACCTTTCCGC CAACAGCGCT
TTGGAGACGC TGCGGGCGAT GGACACGGCT TTGACCGAAG TCGATGTGAG CGGGGCCGCG
GGGCTTTCCG CAATCGAAGT TGACGACGAC GTAGCGGTGG TAGGGCTTGA GGCCACGCCG
ATTCACGAGC AGTGGGTGCC TGTCCACGTG ACGAAGGTCG TGTCGTCTTC TGGTGGAACC
TCTACATACG AGTACTCGTT GAGTTTGGAC GAGGCTGGGG TCGCCACGGG GTATGCGGAA
AGCACCGTGG GCGTGGACGA CACTTCTGAA CAGGCAGTTT ACGAGTTCAA CTACGATGAA
TCCGGCAACA TCATAGGCAT TTGGGATGTC GAAGCCGAAT CTATGGACGT GGAGTTCGTC
TACGACGATG CGGGCAACGT CATTCGCCGC ACGGGTGTGC CCACGGAATA TTCCTACACC
TACGACGATC AGGGGCGGTT GGAATCCTTC GTCTCGACAG GCACCGTCCT CACCCTCGCC
TACGACGAAG CGGGCCGGCT TGCGTCCTAC ACCAGCAGTT TCGGCGGACG GGCGGTTGAA
TACACCTACG CCTACGACGC GGAGGGCAGG GTGACAGGCG TGAGCGATTC GTCTGACCGG
CCGATGGAAT ATACCGTGGA CTACGACGAT AACGGCGCGT GCTCGTCCAT CCTGATCGAA
GGTGCCGTTG GAAGCGAAAC CTACCGGTTC GTTCGCGATG AGGACGGCCG CCTGACGAAC
ATGACGATGG AGACCACGGG CGACTTCAAA CAGCTCGGCA ATGTGGACGG CGTTCAATTC
GAATACAACG CCGCAGGTCA GATCAGCGGA TTTACCTACG AGAACTGGAA CACCGAGTAC
ACCTTCGCCG TGGAGTACGA GCGGTTCTTC CTGGGTCAGG GCGAATGCGT GCAGGCGACG
CTGGGCAGCT TCGCCAATCC GCTGTGGTGG ACGTCGGAGG GTCTGACCTG GATGAACCCC
GAACGATGCG TCCGCACCGA GGGCGTCATT TCTGCGAGTA TGGCACGCTA G
 
Protein sequence
MPSTGLRSKL RTALAAALAL SAVAAVVAVV LAGWEASGSD YALTSEEFPD NRVLAVMESF 
DTDGDGGLSR EEAAAVTEVS VEGTEDLAFL ALLPNVTTLA VSGTNIEFLN LSGANNLERL
TLDCPALTEV TWGKTGQLAY VDVSGTKLAE LDAAVLQAAA YLDVTGCENL AELNLSANSA
LETLRAMDTA LTEVDVSGAA GLSAIEVDDD VAVVGLEATP IHEQWVPVHV TKVVSSSGGT
STYEYSLSLD EAGVATGYAE STVGVDDTSE QAVYEFNYDE SGNIIGIWDV EAESMDVEFV
YDDAGNVIRR TGVPTEYSYT YDDQGRLESF VSTGTVLTLA YDEAGRLASY TSSFGGRAVE
YTYAYDAEGR VTGVSDSSDR PMEYTVDYDD NGACSSILIE GAVGSETYRF VRDEDGRLTN
MTMETTGDFK QLGNVDGVQF EYNAAGQISG FTYENWNTEY TFAVEYERFF LGQGECVQAT
LGSFANPLWW TSEGLTWMNP ERCVRTEGVI SASMAR