Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3814 |
Symbol | |
ID | 8727572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 4582641 |
End bp | 4583720 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | NHL repeat containing protein |
Protein accession | YP_003388605 |
Protein GI | 284038675 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000186749 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAT TACCTATCCA CAGGGTTGTA GTTTACGTTT TGATAGCGTT TAGCTTTGCT TGCAAAAAAG AACCTTCCGT TGAAGTTGCC AAAGAATTAA AGCCAACTAC CGTCAGTACC TTTCTGTCTC CTAGCCAGTT AAACAACTCC AGACCCGTCT CTGTAGCCTT TGATAAAGCG AATAACCTAT ACTTCGCAGA TGGAATTGCC CGTATTTTCA AAGTCACTCC ACAAGGCAAT CTGAGCTTAT ACGCCGGTTC CGGCGGAACC GGCTACCAGG ACGGTTCACT CGACAAAGCT AAGTTTCTGT GGCCTTATGG TTTGGCTTTT GATCGCGCAG GTAATCTGTA TGTGGCTGAT TCGGGTAACC AGGCAATTCG AAAGATTAGT CCAGATGGTC AGGTTACTAC GTTTGCGGGT CAACCCTATG ATGTGACGAG CATAACTAAT GTGAGTGTCG ATGGTATTGG CAAAGAAGCT CGGTTCTATA ATCCACTCGT TCTGACCATC GATCGTTCTG ACAATTTGTT TGTTGGGGAA TATGCGGATG CTAGTGCTTA CGTTGCTGTT ATTCGAAAAA TCACACCCCA AGCGAACGTG ACTTCTTATG CAGGTACGGC CGGCAAAATG ACTTTGCAAC AATACAGCAA CCCGTCTACC CTTATTTCCC CCCGAGGGTT AGCGGTCAAT TCAAAGGGAG AGCTCTTCAT TGGTTGCCCA GGCGTTATCT ATAAAGTCAG CACAAGTGGG CAAACGACTG TTTACTCTGG GGTTCGGGAG CAGTATGGCA GTAGCCCAAA CGGCCCGATT AACTCCGCTC GCTATGGCCT GATCACCAGC CTTCGATTCG ATTCAAACGA CAATTTATAT ATCGGTGAAA CAGGAGCCGG CATCGTCCGT AAAATAGCAA CAGATGGACA AGTAAGCGAC GTGACGGGTA GTCGACTTGG TGGTTACAAA GACGGCCCCT TACAAGCCGC CGAATTCGGT TCGGTCGAAG ACTTGGCCTT CTCTCCATCC GGCTCGCTTT ATGTGGCTGA TAATCGAAAT GGAGCGATTC GTAAAATCAC CTTTGAGTAA
|
Protein sequence | MKKLPIHRVV VYVLIAFSFA CKKEPSVEVA KELKPTTVST FLSPSQLNNS RPVSVAFDKA NNLYFADGIA RIFKVTPQGN LSLYAGSGGT GYQDGSLDKA KFLWPYGLAF DRAGNLYVAD SGNQAIRKIS PDGQVTTFAG QPYDVTSITN VSVDGIGKEA RFYNPLVLTI DRSDNLFVGE YADASAYVAV IRKITPQANV TSYAGTAGKM TLQQYSNPST LISPRGLAVN SKGELFIGCP GVIYKVSTSG QTTVYSGVRE QYGSSPNGPI NSARYGLITS LRFDSNDNLY IGETGAGIVR KIATDGQVSD VTGSRLGGYK DGPLQAAEFG SVEDLAFSPS GSLYVADNRN GAIRKITFE
|
| |