Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5235 |
Symbol | |
ID | 8729001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 6390051 |
End bp | 6391328 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | |
Product | protein of unknown function DUF349 |
Protein accession | YP_003390005 |
Protein GI | 284040075 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.809819 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAACG CTTCACTGGT AGATGAATAC GGTTACGTCA AAGACGGAAA GGTATTTTTG AAAGGCTACC TGAATTACGA AGACCGCCAG ATTGGCGAAG TCAAACGCAC CGAGCAGGAA GCGCTCGATT ATTTCAAGAA TCGCTTCATC ATCGCGGAGA ACAAAGTCAG CCAGCTAGAA AAAGATATTG AAGAGGCTCA GAACAAAGGC TCTTATCTAA CAAAACTGGT TCAGCTTCGT AAGAAGCTCC TCGGTTTTGA CGCACTGGGC GATTTCCCTC CCCTGCTTGA GCGCCTGGAC GAGCAGGAAA AACTGCTGGC TGACCTGATT ACGGTCAATC AGCTGAAAAA CCTTGAAATA AAACGAGCCC TGATCGCCGA AGCTGAAGCT CTGGCCGATA GTACCGACTG GCGTAACACA GCCGATGCCC TTCAGGAAAT CAAGGTGAAA TGGATCAAAA CGGGTCCTGT CGACAAAGCG ACGGAGGCCG AAGTGGAAGG CCGGTTTCAG GAACTTCTGG ATGGCTTCTT TACCCGCCGA CGTGAGTTCT TCAACGAACA GAACAAGGTT ATTCAGGAGC GGCTTGAAAA ATACGATGAA CTGATTCGAC TTGCCTTCCG GGCCAACCGC CTGGGCGATC TGGATGCAGC TTTTCAGGAA GTACGCAAAT TAAACAATGC CTGGAAACAG GTGGGTGAGG TGCCCATCAA AAAGAGCGGC AAGCTCTACA AGCAGTTTAA GAAGGCGACG ACCATGTTCT ACGCCAAATA CAACGACGCG AAGGGCATTG TAATCGTACC GAAGATCGAT CCTCGCATCG AGCAGCAGAT GAAAATGGCC GACGAAGCCG AAAAGCTCTC GAAACAGTCT GATATTTTTG CCGCTGCCGA ACGGGCCAAA GTGCTACTCA ATAGCTGGAA GGAAATTCGG GTGCCATTTA AGCTTCAGGA TAAAGTCGTT AATGAACGCT TCCGGGCAGC CTGCGACAAG ATTTTCGAAC TTAGTTATCT GGGACGGGTA TTGACGCGTA AATACCCGGC GTTTGAACTC AAAAGCCAGT CGGAACAGAT TCGGACGAAG ATTCGGGAGA TGGAGTACCT GGTCAAGCGG GAGAAGAACG ACCTTCAGTT TGCTTTACAG GACGCCGACG GTCTCGACCC GAACAAGGAC GAAGACAAGC AAATCCTGAA CAAGATCAAC ACCCAGAAGC GCAAAATCGC CATGAAAGAG ACTATTCTTC GTGAGTTTCA GAAGCAACTG GAAACAGCAG GCTATTAG
|
Protein sequence | MENASLVDEY GYVKDGKVFL KGYLNYEDRQ IGEVKRTEQE ALDYFKNRFI IAENKVSQLE KDIEEAQNKG SYLTKLVQLR KKLLGFDALG DFPPLLERLD EQEKLLADLI TVNQLKNLEI KRALIAEAEA LADSTDWRNT ADALQEIKVK WIKTGPVDKA TEAEVEGRFQ ELLDGFFTRR REFFNEQNKV IQERLEKYDE LIRLAFRANR LGDLDAAFQE VRKLNNAWKQ VGEVPIKKSG KLYKQFKKAT TMFYAKYNDA KGIVIVPKID PRIEQQMKMA DEAEKLSKQS DIFAAAERAK VLLNSWKEIR VPFKLQDKVV NERFRAACDK IFELSYLGRV LTRKYPAFEL KSQSEQIRTK IREMEYLVKR EKNDLQFALQ DADGLDPNKD EDKQILNKIN TQKRKIAMKE TILREFQKQL ETAGY
|
| |