Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sked_33170 |
Symbol | |
ID | 8634950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sanguibacter keddieii DSM 10542 |
Kingdom | Bacteria |
Replicon accession | NC_013521 |
Strand | + |
Start bp | 3693113 |
End bp | 3694768 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | carbohydrate-binding protein |
Protein accession | YP_003316046 |
Protein GI | 269796591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0165973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGAA CACTCCCGGC ACGACCACGA CGCGCGACGG CCGCAGCCGT CGCCCTGGGC ATGGTGGCCC TCGCCGCCTG CGGCAGCAGC GGCTCCGCCG ACGGCGACTC CGAGGCCGCG AAGGGCAACG ACGCCGGAGC GATGGCCGAC TACGCCGTCG GCGACCAGTT CACCGCGACC GAGCCGCTGT CGTTCTCGGT GCTCTACAGC GACCACCCCA ACTACCCGAT CCAGGACGAC TGGCTGCTGT GGTCCGAGCT CGAGGAGCGC ACCGGCGTCA CCCTCGAGCC GACCGTGGTC CCGATGAGCG ACTACGAGCA GAAGCGCAGC CTGCTGGTCG GCGCGGGCGA CGCGCCGCTG ATCCTGCCCA AGACGTACCC GGGCCAGGAG GCCGCCTTCG TCGCCTCCGG CGCGATCCTG CCCGTGAGCG ACTACGTCGA CCTGATGCCG AACTACCAGG CGAAGGTCGA GGCCTGGGGT CTCGAGCCGA ACCTCGACAC GCTGCGCCAG GAGGACGGCA AGTACTACGT GCTCCCCGGG CTGCACGAGG CCGTGTGGCA GGACTACACG ATCGCGGTGC GGACCGACAT CATGGACGAG CTCGGGCTCG AGGAGCCAGC GGACTGGGAC GAGTTCCGCG ACATGCTCGC CGCCATGAAG GAGGCCTACC CGGACGTCTA CCCGCTGTCC GAGCGCTTCA GCATCCCGAC CCCCGCCGGC AACCTGCTCA ACCTGCTCGG CATGACCTAC GGGACCTCCG CCGGCTGGGG CTACAACAAC CAGCAGTGGG ACCCCGAGAC CGAGGAGTTC GTGTTCCCGG GCACCACCGA CGAGTACCGC GACATGCTCG AGTACCTGCA CGGGCTCGTC GAGGACGGCC TCATGGACCC GGAGAGCTTC ACTCAGGACG ACGACACCGC CATCTCCAAG CTCGCCAACG GGCGGTCCTT CGCGATCAGC ACCAACGCCC AGACCCTCGT CAACGACTAC CGCCCGGCCC TCTCGGCGAC ACTCCCCGAC GCGGAGATCG CCAAGATCCG GCTGCCTGCC GGACCGGCTG GTGACGTGGT CAGCGGCACC TCGCTGCTCG AGAACGGCAT CATGATCAGC GCCGCCGCGG CCGACGACGA GAACTTCGTC GCGATGATGC AGCTCATCGA CTGGCTCTGG TACTCCGACG AGGGCCAGGA GCTCGCCAAG TGGGGCGTCG AGGGCACCAC CTACACCAGG GACGCCGACG GCACCCGCGT GCTCGACCCC GAGATCGACT TCATCGGCCT CAACCCCGGT GCCCCGCAGC ACCTGCAGAA GGACTTCGGG TTCTCGGGCG GCGTGTTCGC GTACGGCGGC ACGACGGACC TGCTGTGGTC GACGTTCTCC GACGAGGAGG TCGCCTTCCA GGAGGGCATG GCCGACAAGG AGGTGCTGCC CCTGCAGCCG CCGTTCCCGC TCGACGAGCT CGAGCGCGAG CAGGCGACCC TGCTCGAGAC CCCGCTGCGC GACACCGTGC AGCAGGCCTC CCTGCAGTTC GTCCTCGGTC AGCGCGACCT CGCCGACTGG GACGCCTACG TCGCCGAGAT CGAGGCGAAG GGGTCGACGC AGTACATCGA CCTCGTGAAC TCCGCCCACG AGCGCTACGT CGCCGAGAAC GGCTGA
|
Protein sequence | MTRTLPARPR RATAAAVALG MVALAACGSS GSADGDSEAA KGNDAGAMAD YAVGDQFTAT EPLSFSVLYS DHPNYPIQDD WLLWSELEER TGVTLEPTVV PMSDYEQKRS LLVGAGDAPL ILPKTYPGQE AAFVASGAIL PVSDYVDLMP NYQAKVEAWG LEPNLDTLRQ EDGKYYVLPG LHEAVWQDYT IAVRTDIMDE LGLEEPADWD EFRDMLAAMK EAYPDVYPLS ERFSIPTPAG NLLNLLGMTY GTSAGWGYNN QQWDPETEEF VFPGTTDEYR DMLEYLHGLV EDGLMDPESF TQDDDTAISK LANGRSFAIS TNAQTLVNDY RPALSATLPD AEIAKIRLPA GPAGDVVSGT SLLENGIMIS AAAADDENFV AMMQLIDWLW YSDEGQELAK WGVEGTTYTR DADGTRVLDP EIDFIGLNPG APQHLQKDFG FSGGVFAYGG TTDLLWSTFS DEEVAFQEGM ADKEVLPLQP PFPLDELERE QATLLETPLR DTVQQASLQF VLGQRDLADW DAYVAEIEAK GSTQYIDLVN SAHERYVAEN G
|
| |