Gene Sde_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_0140 
Symbol 
ID3965370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp172389 
End bp173915 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content45% 
IMG OID637919199 
Producthypothetical protein 
Protein accessionYP_525616 
Protein GI90019789 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATATTC GTGCATTAAC TAGCGAGATT AAATCCAAGG CTTGGGTGCT TGTTACTTTG 
CTAGGGGTGG TTGCTTTTGC AGTATTGGCC TATTGGCCTA CTTGGCAGTC TCTACATAAT
GTATGGGCTA TGCTCGATCA AAGCTATTCC CACGGCTATC TTATTGCTCT TGTTGTTCTT
TATTGGTTAT TTAAGGCTGT ATTGCAATAT CAAGCCTCGC CCAAACCTAG CTATGTAGCT
GTGTTAGGCA TTATAGCGTT AGGGGGCTTG TGGCTGTTTG GTGAGGCCAC TCAAACATTG
CTGTTGGCTC AGCTGGCGCT GCCGTGTCTG CTGTGGTTTT CCATTTTTGC CATATTAGGC
CTGCGTTTTG CGGTCAGCGT TGTGCCATTG TTGCTTATAT TTATATTCGC TATACCCCTG
TGGGATATTC TTACGCCAAT ACTGAGGGGT ATTACCACAT ATGTAGTCCA ATTTGGCGTG
TCGATACTGG GAATACCGGC ATACTTCGAT GGCTTTAAAA TAGAAGTGCC GGCTGGTGTG
ATCGAAATTG CATCAAGTTG CGCGGGCTTA AACTACTTTC TTATGGCTAA TGCCTTTGCA
GCCATTTATA GCTATCAGAA CCAATTAGAT ACAAAGCGTA CAATTCTGTG TGCTCTAGTT
GCAACAGCCA TAGCGTTGGT GGGTAACTGG GTACGTGTCT ATATACTTGT TTTAGTGGGG
TATTATTCCA ACATGACCCA CTCATTAATG CACTCGCACG CAAATTTCGG TTGGATACTG
TTTGGCTTAT GCATGGTGCC AATGTTATAT GTATTTGGCC GTATATACGC GACAAGCACC
TGGGCCCAAA GCGCAGCTGA TGAGCCTACA CCGCCCAAAA CCCAAGCAGA CCTTTCTCCA
AAAACGTTTA TTATTAGCGT GTTGCTTGTG TGCTTTTGCT TGGCTTTGGC GCCTGCCACG
CTGCACTGGA ATAGCGTTAA AGGAGATGTG GCAAGCTATG AAGTTTCTAC CATTAACGGT
GCCAAGCTAA GTCAGTATTC TGCTGTTGCA TGGCGGCCCG CTTTTAAAGG GTATGATGCA
CATTATTCTT GGTTGGGGCC TGTAGCTGGT TTGCAGTCGC AACTACAAGT TCTGTTGTAT
ACGCAGCAAG CGCAGGGTAA AGAGCTTATT TATTACGACA ATTTGCTTGC GCCAGCGCAC
AACTTGAAAA AATTAGGCGA CTTCAATGCA AATGGTAATA TGCCGTTAGC AATGGCCGCT
GTAAACTCAG GTGGTCAACA GCGGCTATTA ATTTGGTCCT ACAACTTGGG CGGTAAATAT
ACTACCTCGC CTATCAGCGC AAAACTACTG CAATTTTTAA GCTTGTTTAA TCAGCAACCC
TATGCCGCTC TCGTGGTACT GGTATTTGAC TGCCAGACCA ATTGCACACA GGAGCAGGCA
GTAATAAATG CCTCACCAAG TGAAATAGAT GTAATTTTTT CACAACTAAA CATCAAGGCC
CTAGCGCCCT CAAATAAATG GAAATAG
 
Protein sequence
MHIRALTSEI KSKAWVLVTL LGVVAFAVLA YWPTWQSLHN VWAMLDQSYS HGYLIALVVL 
YWLFKAVLQY QASPKPSYVA VLGIIALGGL WLFGEATQTL LLAQLALPCL LWFSIFAILG
LRFAVSVVPL LLIFIFAIPL WDILTPILRG ITTYVVQFGV SILGIPAYFD GFKIEVPAGV
IEIASSCAGL NYFLMANAFA AIYSYQNQLD TKRTILCALV ATAIALVGNW VRVYILVLVG
YYSNMTHSLM HSHANFGWIL FGLCMVPMLY VFGRIYATST WAQSAADEPT PPKTQADLSP
KTFIISVLLV CFCLALAPAT LHWNSVKGDV ASYEVSTING AKLSQYSAVA WRPAFKGYDA
HYSWLGPVAG LQSQLQVLLY TQQAQGKELI YYDNLLAPAH NLKKLGDFNA NGNMPLAMAA
VNSGGQQRLL IWSYNLGGKY TTSPISAKLL QFLSLFNQQP YAALVVLVFD CQTNCTQEQA
VINASPSEID VIFSQLNIKA LAPSNKWK