Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3988 |
Symbol | |
ID | 8727746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4794933 |
End bp | 4796261 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003388777 |
Protein GI | 284038847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.74321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0290381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAAG TAATCGGCAA AGACAAAAAG ACGCGGGTGC GCTACAGCAT GCTGGCACTG GTCTTCATCA ACGTAGTGAT CAACTACCTC GACCGGAGCA ATATTTCGGT GGCGGGGACT GCGCTGAGTA AAGACATGGA CCTGTCGTCG GAACAGTTGG GGTTCATTTT CTCGGCTTTT GGCTGGACCT ACGCCCTGCT ACAAATACCG GGCGGCCTTA TCGCCGACCG CTTTGGTCCC CGCATTCTCT ATGCATTTTG TCTGATTACC TGGTCCTTGG CCACCGTTTG TCAGGGCTTT GTTCGGGGGT TTGCCAGTCT GTTTTCGCTT AGGCTGGCAA CGGGCGCGTT TGAAGCCCCT TCCTACCCCA TCAACAACCG CATTGTTACA AGCTGGTTTC CCGAACACGA ACGGGCTTCG TCTATTGCCT TGTATGTTTC GGGACAGTTT ATCGGCCTTG CGTTTTTAAC ACCCGTACTG ACTTATATCC AGAGTCAGTT CGGGTGGCAG GGTTTGTTCG TGTGTACCGG TATCGTTGGG CTGATCTGGG GCGTTATCTG GTACCTCTTT TACCGCGACC CGCTCGATCA TCCGAAGGTG AACGACGCCG AGCTGGCCTA CATCGAAGAA GGGGGTGGCC TGTTCAGAAG TCGGCAGGCG GGTACGAATA AAGCGTCTGT CTGGAGCTGG GTAAACGTGA AGCAGGTGTT TTCCTCCCGC ACGTTATGGG GAGTTTACAT CGGGCAGTTT GCCGTTAACT CCATGCTCTG GTTCTTCCTG ACCTGGTTCC CCACCTATCT GGTCAAATAC CGGGGGCTGG ATTTCATCAA GTCGGGCTAT CTGGCATCGG TACCTTTTCT GGCGGCCTGT GCGGGTCTGC TCCTCTCCGG CTTCGTCTCC GACAGACTGG TGAAGCAGGG GAAATCGGTA ACGATGGCGC GTAAAGCACC GATCATCATC GGTCTGCTGC TGTCGATCAG TATTGTCGGG GCCAATTACA CGAACGATAC CGCATTGATC ATCGCCTTTA TGGCTTTGGC TTTCTTTGGC TCGGGTATGG CGTTGATCTC CTGGGTGTTC GTATCTATTC TATCACCCAA ACATCTGATT GGTCTAACCG GTGGCGTGTT CAATTTCATG GGCAATCTGG CGTCCATCGT AGTACCTATC GTGATTGGCT ATCTGGCCAA AGACGGTGAT TTCAAACCAG CGCTCGTCTT CGTCGGCGCC CTGGGCCTGA TTGGAGCCTG TTCTTACATA TTCCTGGTGG GCAAAATAGA ACGGGTCGTG ACTCATGACC CGCAGGAAGG GGTCTTTGCG GGGGAGTAA
|
Protein sequence | MEQVIGKDKK TRVRYSMLAL VFINVVINYL DRSNISVAGT ALSKDMDLSS EQLGFIFSAF GWTYALLQIP GGLIADRFGP RILYAFCLIT WSLATVCQGF VRGFASLFSL RLATGAFEAP SYPINNRIVT SWFPEHERAS SIALYVSGQF IGLAFLTPVL TYIQSQFGWQ GLFVCTGIVG LIWGVIWYLF YRDPLDHPKV NDAELAYIEE GGGLFRSRQA GTNKASVWSW VNVKQVFSSR TLWGVYIGQF AVNSMLWFFL TWFPTYLVKY RGLDFIKSGY LASVPFLAAC AGLLLSGFVS DRLVKQGKSV TMARKAPIII GLLLSISIVG ANYTNDTALI IAFMALAFFG SGMALISWVF VSILSPKHLI GLTGGVFNFM GNLASIVVPI VIGYLAKDGD FKPALVFVGA LGLIGACSYI FLVGKIERVV THDPQEGVFA GE
|
| |