Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3473 |
Symbol | |
ID | 8727226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4205594 |
End bp | 4206631 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | 3-dehydroquinate synthase |
Protein accession | YP_003388280 |
Protein GI | 284038350 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000213931 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACCG TAACCATTGC CCCACTCGCC GAAAGCCTGC CTGCTTTCCT TGAATCTTAC GATTTTTCAG CCATTGCCGT TATTGCTGAT AACCATACGT TTCGGTTTTG TTATCCCGAA CTGAAAGCCT TTTTGCCTAA ACACACCCTT GTCCGGATCA AATCGGGAGA GGAGCAGAAG CACATCGCTA CCTGCGAAAT GATCTGGGAT GCCCTCACGC GGGCGAATTT CGACCGTCAT GCGCTGGTGC TCAACCTCGG TGGGGGCGTC ATTGGTGATA TGGGTGGTTT TTGCGCGGCT ACTTACAAGC GCGGTATTGC CTTCGCGCAG CTGCCCACTA CCTTACTTTC GCAGGTAGAT GCCAGTGTAG GCGGTAAACT CGGGATTGAT TTCCGGGGCT TCAAGAATCA CATTGGGGTA TTTCAACAAC CTAATACGGT ACTGATTGAC CCAACGTTTC TGTCTACGCT GCCTGAGCGT GAACTCCGGT CAGGGTTTGC CGAAGTCATT AAGCATTGTC TCATTGCTGA TGCTGCGATG TGGGACGAAA TTCGTCGGCG CGACCTCGAC GAGCAGGACT GGGCGGCACT GGTGGCTCAT TCCGTAGCGG TAAAACGACG CGTCGTTGAG CAGGATCCTA CCGAAAAGGG ACTACGGAAG ATTCTGAACT TTGGCCATAC GCTGGGCCAT GCGGTAGAAA CGTATTTCCT GACGCAGCCC CGGAAACGGC TCCTGCATGG TGAAGCCATT GCGATGGGTA TGATTGCCGA AGCTTACATT GCCTATCAGA AAAAAATGAT CGATGAATCG CTGCTCACGC AAATCGAGGA ATACATATTT GCCGTATATG GTAATGTGCG GTTGTCGGAC GAAGATACCG AACCGATTCT GGCCCTGACC CTGCAGGATA AGAAAAATCG GGGGCGTGAA GTACGAATGT CGCTACTGGA TGGAGCCGGG AGTTGTGCCT TCGATATTCT GGTATCAAAC ACCGAGATGC GGAAGGCAAT CGAGTTCTAC CGAGGCCTAA ATAAATAA
|
Protein sequence | MSTVTIAPLA ESLPAFLESY DFSAIAVIAD NHTFRFCYPE LKAFLPKHTL VRIKSGEEQK HIATCEMIWD ALTRANFDRH ALVLNLGGGV IGDMGGFCAA TYKRGIAFAQ LPTTLLSQVD ASVGGKLGID FRGFKNHIGV FQQPNTVLID PTFLSTLPER ELRSGFAEVI KHCLIADAAM WDEIRRRDLD EQDWAALVAH SVAVKRRVVE QDPTEKGLRK ILNFGHTLGH AVETYFLTQP RKRLLHGEAI AMGMIAEAYI AYQKKMIDES LLTQIEEYIF AVYGNVRLSD EDTEPILALT LQDKKNRGRE VRMSLLDGAG SCAFDILVSN TEMRKAIEFY RGLNK
|
| |