Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmwyl1_1347 |
Symbol | |
ID | 5364683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Marinomonas sp. MWYL1 |
Kingdom | Bacteria |
Replicon accession | NC_009654 |
Strand | + |
Start bp | 1507904 |
End bp | 1509121 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640803691 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001340211 |
Protein GI | 152995376 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.969925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000000139909 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTTTTG ATGTAGCGGC TATCCGCGAG CAATTTCCGA TTTTAAAGCG CGTAATTGAT GGTAATCCGC TTATTTATCT CGATAATGCG GCGACAACGC AAAAGCCGCA ATGTGTGATT GATGCTTTGG TGGATTACTA CACGACATGC AATTCCAATG TGCATCGTGG TGCCCATCGT CTTGCGGATG AAGCAACGCG TCGTTTTGAA GACGCACGCG ACATCGTAAA AGACTTTATT AATGCGCCGA AACGCGAAGA AGTCATTTGG ACTACGGGTA CTACAGAAGC GATTAATATT GTCGCCAATG GTTTGGGTTC TTTGTTGTTA CCGGGTGATG AAGTTATTGC AACTGGAATG GATCACCATG CCAATCTAGT GACATGGCAG CAAGCATGCA AAGCCTCTGG TGCGACTCTA AGAACCATTC CAGTGACGGA TGCTGGTGAA TTAGATCAAG CTGCTTATGA TGCCATGCTA AATGAGCATA CTAAGTTTGT TGCTTTTCCG CATGTTTCAA ATGCCTTGGG TACCGTGAAC CCAATTAAAG AAATGACGGC GAAAGCAAAG AAGTTTGGTG CTTGGGTTTT AGTGGATGGT GCTCAAGGTG CCGCTCATGG TCATGTAGAT GTGCAGGACA TAGGTTGTGA TTTTTATGCC TTTTCAGGTC ATAAAATTTA TGGGCCGATG GGTGTTGGGG TACTTTGGGG GAAGGAATCA GTATTGTCGA CTTGGCCTGT TTGGCGCACT GGTGGCGAGA TGATTTCTAC TGTTACTTTG CAAGACGCTA CATGGAATGT ATTGCCTTAT CGCTTTGAAG CGGGCACGCC AAATGTGGGT GATGCCATTG CTATGGGTGA GGCGATTCGT TGGTTTAGCG CCTTGGACCA GGAGGCTGTT GCTGCCCACG AAAAAGCCTT GCTGGATCAT GCTACTGAAC TTGCCGAGCA ATTCGAAGGC TTAACCATTA TTGGTACTGC TAAAGAAAAA ATTGGAGTAC TGAGCTTTGT AATGGATCAG GGGCATCCTG CTGATATCGG TTTTTTGTTA GATCGACAGG GGATTGCTGT TCGTACTGGT GATCATTGTG CGCAGCCTTT GATGGCTCGT TTTGGCGTTC CTGGTACAGC GCGAGCATCG TTCGCGATTT ATAATACTTT AGAAGAAGTT GACGCTTTGT TTGTGGCACT GAAAAAAGTG CGAACAATGC TGGCTTAG
|
Protein sequence | MSFDVAAIRE QFPILKRVID GNPLIYLDNA ATTQKPQCVI DALVDYYTTC NSNVHRGAHR LADEATRRFE DARDIVKDFI NAPKREEVIW TTGTTEAINI VANGLGSLLL PGDEVIATGM DHHANLVTWQ QACKASGATL RTIPVTDAGE LDQAAYDAML NEHTKFVAFP HVSNALGTVN PIKEMTAKAK KFGAWVLVDG AQGAAHGHVD VQDIGCDFYA FSGHKIYGPM GVGVLWGKES VLSTWPVWRT GGEMISTVTL QDATWNVLPY RFEAGTPNVG DAIAMGEAIR WFSALDQEAV AAHEKALLDH ATELAEQFEG LTIIGTAKEK IGVLSFVMDQ GHPADIGFLL DRQGIAVRTG DHCAQPLMAR FGVPGTARAS FAIYNTLEEV DALFVALKKV RTMLA
|
| |