Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_0224 |
Symbol | |
ID | 8595720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | + |
Start bp | 234477 |
End bp | 235514 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | |
Product | protein of unknown function UPF0118 |
Protein accession | YP_003307040 |
Protein GI | 269118863 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000072412 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTG AAAAGACAAA AGATAAGCTG GTTACACTTG TCTTGTGTTT GGTATGTTTA TTTCTTATAA CAAAGGTTCT GCCGTATTTT GTACCGTTTT TAAATGTAAT ACTCAGTGCT TTGATACCGT TTATTCTTGC TTTTGTTATT ACTTATGTTC TTGAGCCGGC AGTGGAGTTT TTGGAGAAAA AACTGAATTT TAAGAGAATG AGTGCTTTTA TGATAGTTTA TTTTGTAGTG ATGTTTGTTT TTATTGCTAT GGTTCTGGCT CTTATACCGG AAGTGGTGAA TCAGTTTAAC AGCATGATAA GCTTTATAAT AAACCATCAG GGAGAAATAC AGCTGAAGGT CTCAAAATAT ATAGAGCACT CTCATATAAA CATATCAGAG ATAGTGTATA AGCTGAAGGA ATGGTTTTTC AGATATATAT TTAGTCTTCT GAATTCTGGA ATTTCGCTTA TAAAAGCATT TTTCAGTATA GTTTTTATGA CTCCGATCTT TTTATTTTTA CTTATGAAGG ATTACCGAAG TCTGAAAATG AAGCTGAAGC TTCGGATACT GGAAGCTGAC AGAAGAGATA TAATAATAAT AATGCGTAAT ATAGACGTGG TTCTGGGAAA ATATGTCAAA GGCAAGCTGA TAGACTGTTT TTTGGTTGGG ACATTAGTGT ATATTATTTT TTCAATATTA GGACTAAAGT TTGCCCTTTT ATTTTCATTT ATAATCGGGG TAACTAATCT GATTCCTTAT GTAGGACCTG TGATTGGCGC AATTCCTGCC TGTTTGTTTG CTCTGCTTCA GTCATTTAAT ATATTTATAG GTGTTCTGAT AGCAATAGTA TTTATACAGA CACTGGAATC AGTATTTCTT GTACCGTATA TAACGAGTAA AACCGTGGAA ATACATGAGA TTACGACCCT TCTTGTTTTG CTTATAGGGG GAAGTCTTTT CGGGATAATC GGTGCCTTGC TGGCAATTCC GGTTTATCTT GTAATAAAAG TAATATATGA ATATTATAAA AATAATAAAG GGGTATAA
|
Protein sequence | MNIEKTKDKL VTLVLCLVCL FLITKVLPYF VPFLNVILSA LIPFILAFVI TYVLEPAVEF LEKKLNFKRM SAFMIVYFVV MFVFIAMVLA LIPEVVNQFN SMISFIINHQ GEIQLKVSKY IEHSHINISE IVYKLKEWFF RYIFSLLNSG ISLIKAFFSI VFMTPIFLFL LMKDYRSLKM KLKLRILEAD RRDIIIIMRN IDVVLGKYVK GKLIDCFLVG TLVYIIFSIL GLKFALLFSF IIGVTNLIPY VGPVIGAIPA CLFALLQSFN IFIGVLIAIV FIQTLESVFL VPYITSKTVE IHEITTLLVL LIGGSLFGII GALLAIPVYL VIKVIYEYYK NNKGV
|
| |