Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3898 |
Symbol | |
ID | 5714427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | - |
Start bp | 121824 |
End bp | 123404 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641276811 |
Product | alpha amylase catalytic region |
Protein accession | YP_001542107 |
Protein GI | 159046436 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.237178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGAC GCGGGCCGTG GCCCGAAAAC CCCGTCATTT ATCAGGTCTA CCCCCGGTCG TTCCTTGACA CGACCGGGAC GGGGGAAGGC GATCTGCCGG GGGTGACCCG GCAGCTCGAT TACATTGCCG GCCTCGGGGT GGACGGCATC TGGCTTTCGC CCTTCTATCC CTCGCCGTTC TGCGACGGGG GGTATGACAT TGCCGATCAT TGCGCCGTCG ACCGGCGGTT CGGCACCCTC GACGATTTCG ATGCGCTGGT GGCGCGGGCC CATGATCTGG ACCTGCGTGT GATGATCGAT CTGGTGCTCA ACCACACGTC GGACACCCAT GACTGGTTCG CAAAATCGCT GGCCCGTGAA GAAGGCTTCG AGGATGTCTA CATCTGGGCG GACCCGTGCA AGGACGGCAG CCCGCCCTCG AACTGGCTGT CGTTTTTCGG AGAGGCCGCT TGGCGCTGGC ACCCGCAACG TGCGCAATAC TGCCTGCACA AGTTTCTGCC CTGTCAGCCC TGCCTGAACC ATTACAACGA CCGCGTGCAC GAACGGCTGA ACCGGATCAC GCGGTTCTGG CGCGACCGTG GCGTCGATGG CTTCCGCTAT GACGCGGTAA CGAGCTTTTT CTATGACCCC GGGTTTCGCG ACAATCCCCC CGCGGCCGAG GCCGAAGCGG CTCTGATCCC CGGGCCATCC AACAATCCAT ATACCTTCCA GGAGCATATT CACGACGTGC TGCCCAACGA ATGCGCTGCC TTCGCGGAAA CCCTGCGCGA GATGGCAGGC CCCGACGCCT ACCTTCTGGG GGAGATCAAC AACGGCCCCC GTTCGGTCGA AGTCACGTGC AAGTTCACCG GCCCCGATCG ACTTGACGCC GGCTATGCGA TCGACTTGCC GGAACGCGGG CCCAGCACGG AGGTACTGCG CGACCTTCTC ACCCGGCTGG AGGATGCTGA AGGATGGACC TGGTGGCTCA ACAGCCATGA CCAGAAACGC GCGGTCTCGT CCTTCGGCGA TGGCGGGGCA GCGGATGCGA AGATGCTCGC AGCGTTCCTT TGCGCGCTGC CCGGCCCCCT CTTGCTGTTT CAGGGCGAGG AACTGGGGCA GCCACAGGCA GAGCTCGAAA AGGTCGAGCT GACCGATCCT TATGACCTGA TGTATTGGCC CGACTCGGTG GGTCGCAACG GCGCCCGCGC GCCCATGGCC TGGGACGACA CGCAGCCCGC ATGCGGCTTC AGCAAAGCGG TGCCGTGGCT ACCTATGGCG CGGGCGGAAC AGGGCGGCGT GGCACAGCAG GAGGCCGACC CGGCCTCGGT TCTCGCCTTT TACCGCGATG CACTTGCCCG GCGGCGTGAC CTGGGGCTTG CCGAGGCCAC GATGGAACTC GAAGACGCGC CTGATGCCTG CATTCGGTTC CGGCTGCGCG TTGGGACGCT CGTTGTGCAG GTGGCCGCCA ACATGTCCGG CGCGCCACAA GACCTCGCAC CCAGACAGGG TGCAAAACGG ATCTTGCAGA CCAAGCCCCC CGCGCCGGGC AGCAACCTTC CGCCCCGCAG CGCTGCCTGG TGGCTGTTGG AGAAAGGCTA G
|
Protein sequence | MPRRGPWPEN PVIYQVYPRS FLDTTGTGEG DLPGVTRQLD YIAGLGVDGI WLSPFYPSPF CDGGYDIADH CAVDRRFGTL DDFDALVARA HDLDLRVMID LVLNHTSDTH DWFAKSLARE EGFEDVYIWA DPCKDGSPPS NWLSFFGEAA WRWHPQRAQY CLHKFLPCQP CLNHYNDRVH ERLNRITRFW RDRGVDGFRY DAVTSFFYDP GFRDNPPAAE AEAALIPGPS NNPYTFQEHI HDVLPNECAA FAETLREMAG PDAYLLGEIN NGPRSVEVTC KFTGPDRLDA GYAIDLPERG PSTEVLRDLL TRLEDAEGWT WWLNSHDQKR AVSSFGDGGA ADAKMLAAFL CALPGPLLLF QGEELGQPQA ELEKVELTDP YDLMYWPDSV GRNGARAPMA WDDTQPACGF SKAVPWLPMA RAEQGGVAQQ EADPASVLAF YRDALARRRD LGLAEATMEL EDAPDACIRF RLRVGTLVVQ VAANMSGAPQ DLAPRQGAKR ILQTKPPAPG SNLPPRSAAW WLLEKG
|
| |