Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_4666 |
Symbol | |
ID | 8885871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 4976197 |
End bp | 4977564 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Beta-Ala-His dipeptidase |
Protein accession | YP_003513402 |
Protein GI | 291302124 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.976885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00791318 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTCTAT CTCGCTCTGA TCTTCGCTCG GCCGTCGACG CGGGTATGCC CACCGTGATC GAGGACCTGA AACAGCTGGC CCGCATACCT TCGGTGGCTT TCGAGGGCTT CGACCACTCG CACGTGACGC GCAGCGCCGA GGCGGTGGCC GAGCTGCTGC GAGGCGCCGG CATGGACGAT GTGCGCATTG TCACGGCCAA GGGCGCCTCC GGGCAGACCG GCCAGCCCGC CGTCATCGGA CGCAAGGCCG CCCCGGCGGG CGCGCCGCAC GTGCTGCTGT ACGCGCACCA CGACGTGCAG CCCGCCGGTG ACTACGACGA CTGGGAGCAG GACGACCCGT TCGAGCCGCA GCTGCGCGGC GAGCGGTTGT TCGGGCGCGG CTGCGCCGAC GACAAGGCCG GGGTGATGGC GCACGTCGCC GCGCTGCGGG CCTTCGGCGA CGACCTGCCG GTGGGTGTGA CGGTGTTCGT CGAGGGCGAG GAGGAGTTCG GCTCCGACTC GCTGGAGAAC CTGATCACCG AGAACCGGGA CCTGCTGGCC GCCGACGTGA TCGTCATCGC CGACTCGGCC AACTGGGACG TCGGGCACCC GGCGCTGACC ACGTCGCTGC GCGGCGTCAT CAACGCCTAC GTGGAGGTGC GCACCCTCAA CCAGGCGGTG CACTCGGGCA TGTTCGGCGG CGCGGTTCCC GACGCGCTGA CCGCCCTGTG CCGTCTGCTG GGCACGCTGC ACGACGAGAC CGGCGACGTC GCGGTCGAGG GCCTGAAGAC GGGCACGGCG GCGCCGCTGG ACCTGCCCGA GGAGCGGCTG CGCGCCGAGT CGGGGATGGT CGACGGGGTC GAGTTCATCG GCTCGGGTCG GCTGGTCGAG CGACTGTGGA CGAAGCCGAC GGCGACGGTG CTGGGGATCG ACGCGCCCGG CGCCCGCGAG TCGGCCAACG CGCTGCAGCC GTCGGCGCGC GCCAAGATCA GCCTGCGGCT GGCTCCCGGC GACGAGTCGG TCTCCGCGTT CGAGGCCGTG AAGCGGCACC TGGAGGCGCG GGTACCGTGG GGTGCGCAGC TGACCGTCAC CCTCGACCAC GGCGGCAACC CGTGCCAGAT CGACGCGCGT GGCGAACGCT ACGACGCCGC CCGGGCCGCG TTCGCCGAGG CGTGGGACGG GGTGGAGCCG GTCGACATGG GTGTCGGCGG CGCGATCCCG TTCATCGCGA CGTTCCAGGA GCTGTTCCCG GACGCGGCGA TCCTGGTGAC CGGCGTCGAG GACCCGGACT CGCGGGCCCA CGGCCCCAAC GAGAGCCTGC ACCTGGCGGA GTTCACGCGC GCCTGCCGCG CCGAGGCGCT GCTGCTGCAC AACCTCAGTG AGCTTTAA
|
Protein sequence | MSLSRSDLRS AVDAGMPTVI EDLKQLARIP SVAFEGFDHS HVTRSAEAVA ELLRGAGMDD VRIVTAKGAS GQTGQPAVIG RKAAPAGAPH VLLYAHHDVQ PAGDYDDWEQ DDPFEPQLRG ERLFGRGCAD DKAGVMAHVA ALRAFGDDLP VGVTVFVEGE EEFGSDSLEN LITENRDLLA ADVIVIADSA NWDVGHPALT TSLRGVINAY VEVRTLNQAV HSGMFGGAVP DALTALCRLL GTLHDETGDV AVEGLKTGTA APLDLPEERL RAESGMVDGV EFIGSGRLVE RLWTKPTATV LGIDAPGARE SANALQPSAR AKISLRLAPG DESVSAFEAV KRHLEARVPW GAQLTVTLDH GGNPCQIDAR GERYDAARAA FAEAWDGVEP VDMGVGGAIP FIATFQELFP DAAILVTGVE DPDSRAHGPN ESLHLAEFTR ACRAEALLLH NLSEL
|
| |