Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4625 |
Symbol | |
ID | 8885830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 4930347 |
End bp | 4931627 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | CBS domain-containing protein |
Protein accession | YP_003513361 |
Protein GI | 291302083 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.373685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00665585 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCTCTT ACTATATGCT GCTCGGCATT GCCGCCGCGT TGGTGGTGAT GGCCGGGCTG GCCGCGATGA CCGACGCCGC ACTGTCGCGC GTCTCCCCCG CAAGGGTCGA AGAACTCGTG CGAGACCGGC AACGGGGAGC CGTCTCGCTC AAGAAAGTCG TCGCCGACAT TCCCCGGTAC ATCAACCTCC TCATGCTGCT GCGGCTGGCC TGCGAGCTCA CCGCGACCAC ACTGGTCGCC GTCGCCGCCT TCCGCGCCTG GGACATCGGC TGGGCCGCCA CCGCCATCAC GGCGGCCGCC ATGACCATCG TCAGCTTCGT CCTCATCGGA GTCGGCCCCC GCACCCTCGG CCGCCAACAC GCCTACCCCG TCGCCCTGGC CAGCGCGGGC ATCGTCCACT GGCTCGGCCG CGCCCTGGGC CCGCTCCCCA AACTGCTCAT CCTCATCGGT AACGCCGTCA CCCCGGGCAA AGGCTTCCGC GAAGGTCCAT TCGCGACCCA GACCGAACTG CGCGAACTCG TCGACATCGC CGAACAGCGC GGCGAGGTCG AACACGGCGA ACGCGAGATG ATCAACTCGG TCTTCGCCCT GGGCAACACG ATCGCCCGCG AAGTGATGGT GCCCCGCACC GAGACCGTAT GGGTCGAGTC CAGCAAGTCC GCCAAACAGG CCCTCGCGCT GGCGCTGCGC TCCGGCTTCT CCCGCATCCC GGTCATCGGC GACGACGTCG ACGACGTCTC CGGCGTTGTC TACCTCAAGG ACCTGGTCCG CCTCACCGGC GAAGGCGCCG ACCCCAAGGT CTCCGAGGTC ATGCGCGACG TCACCTTCGT CCCCGAATCC AAACCCGTCG ACGACCTGCT CCGCGAGATG CAGGCGGCCC GCATCCACAT CGGAGTGGTC GTCGACGAGT ACGGCGGCAC CGCCGGCGTC ATCACCATCG AGGACATCCT CGAGGAGATC GTCGGCGAGA TCACCGACGA ATACGACGTG GAACGCCCCA CGGTCGAAGC CCTCCCCGAC GGCGCCCACC GCGTCACGGC CCGGATGTCC ATTGAAGACC TGTCCGAACT GTGCGGCGTC GACATCGAAC CCGGCGACGT CGAAACCGTC GCGGGCCTGC TGGCCCAAGC CCTGGGCAAG GTCCCCATCC CCGGCGCCCA CGCCACCATC CACGGCCTCG ACCTCACCGC CGAAGGCACC GCGGGCCGTC GCAACCGCAT CGACACCGTC GTCGTCCGCC GCATCGCCGA CCAACCCGAC GAAGAGGAAG AGAACCACTG A
|
Protein sequence | MSSYYMLLGI AAALVVMAGL AAMTDAALSR VSPARVEELV RDRQRGAVSL KKVVADIPRY INLLMLLRLA CELTATTLVA VAAFRAWDIG WAATAITAAA MTIVSFVLIG VGPRTLGRQH AYPVALASAG IVHWLGRALG PLPKLLILIG NAVTPGKGFR EGPFATQTEL RELVDIAEQR GEVEHGEREM INSVFALGNT IAREVMVPRT ETVWVESSKS AKQALALALR SGFSRIPVIG DDVDDVSGVV YLKDLVRLTG EGADPKVSEV MRDVTFVPES KPVDDLLREM QAARIHIGVV VDEYGGTAGV ITIEDILEEI VGEITDEYDV ERPTVEALPD GAHRVTARMS IEDLSELCGV DIEPGDVETV AGLLAQALGK VPIPGAHATI HGLDLTAEGT AGRRNRIDTV VVRRIADQPD EEEENH
|
| |