Gene Sde_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1139 
Symbol 
ID3968326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1475694 
End bp1476914 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content47% 
IMG OID637920210 
ProductMSHA biogenesis protein MshG 
Protein accessionYP_526613 
Protein GI90020786 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAGT TTAAATATTC CGGCCGCTCG AAGCAGGGGC AGGTATTAAC CGGCGAAATG 
GAAGCTGCAA CGGTTGATGC GGTGGCGTCG GCACTTATTG GGCGCGGCAT TACACCAGTA
AAAATCGAAC CCTTTTCTGC TGCCTCATCG TATATGCGGC AGCTAAATAG CGCGCTGGGT
GGCGATAAGG TGGGCACTAA CGACCTGATT ATGTTTTGTC GCCAAATGTA CACCATTACC
AAATCGGGTA TTCCACTAAC CCGAGGTATT CGCGGCTTGG GCGCAAGTAT TCGCCACGAG
CACTTTAGAG ATGTACTCGG CGATGTAGCT GAACGTTTAG AGGCTGGTGT AGGCTTGTCG
CAAGCAATGC GTCATCATCC TAAAGTGTTT AATAGTTTAT TTGTAAGCAT GGTTGCTGTG
GGCGAAACCA GCGGCAACCT CGACGAAATA TTTCGCCAAA TAGGTTTTTA CTTAGAGCGC
GACGAAGAAA CACGTAAACG TATTAAGCAA GCAACGCGTT ACCCAACGTT TGTAAGCATC
GCTATTGTGC TGGCAATGGC TGCGGTAAAT ATTTGGGTTG TGCCAGCATT TGCGGATATG
TTTGCCAAAT TTGATGCAGA CCTGCCAATT GTGACCCGTA TTTTAATTTT TACCTCTAAT
GCATTTGTAA ATTATTGGTT ACTTATGTTG GTTGTTGTAG GTGGTATGGT AGGTGGTGCT
TACTATTATT TGAATACGCC AGAGGGAGCG TTGCAGTGGG GTAAAAAGCG GTTAAAAATG
CCGTTGGTTG GCGAGCTAAT CGAGCGCGCT ACCATGGCCC GTTATGCGCG TAGTTTTGGT
TTGATGTTGC GCGCAGGTGT TCCGGTGAAC CAGGCCTTGG CTCTGTGCGC AGCAGCAATC
GACAACCCCT ATATCGCCGC AAAAATACAG CAAATTAGAC AAAGCATTGA GCGCGGTGAA
AGCTTATTGC GTACTCATCT TCAAGCGGAA ATGTTTACAC CACTGGTTTT GCAAATGATA
GCCGTAGGCG AAGAGAGCGG CCAAGTAGAG GCGCTGCTCA CCGAAGTAGC GGAATTCTAC
GAGCGTGAAG TGGACTACGA CTTAAAAACG CTTACCGATC GTATTGAACC TATATTAATT
ATTGTTATGG CGGCGTTTGT GGCTCTGTTA GCTGTAGGCA TTTTTCTCCC AATGTGGAGC
ATGTACGAAG TACAGGCGTA A
 
Protein sequence
MSKFKYSGRS KQGQVLTGEM EAATVDAVAS ALIGRGITPV KIEPFSAASS YMRQLNSALG 
GDKVGTNDLI MFCRQMYTIT KSGIPLTRGI RGLGASIRHE HFRDVLGDVA ERLEAGVGLS
QAMRHHPKVF NSLFVSMVAV GETSGNLDEI FRQIGFYLER DEETRKRIKQ ATRYPTFVSI
AIVLAMAAVN IWVVPAFADM FAKFDADLPI VTRILIFTSN AFVNYWLLML VVVGGMVGGA
YYYLNTPEGA LQWGKKRLKM PLVGELIERA TMARYARSFG LMLRAGVPVN QALALCAAAI
DNPYIAAKIQ QIRQSIERGE SLLRTHLQAE MFTPLVLQMI AVGEESGQVE ALLTEVAEFY
EREVDYDLKT LTDRIEPILI IVMAAFVALL AVGIFLPMWS MYEVQA