Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2889 |
Symbol | |
ID | 5710740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 3045531 |
End bp | 3046514 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641268815 |
Product | putative peptidase |
Protein accession | YP_001534223 |
Protein GI | 159045429 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3191] L-aminopeptidase/D-esterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00143546 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGC GGATGCGGCC CGGTCCGCGC AACCTGATCA CCGATGTGGC GGGGCTGCGC GTGGGTCATG CGCAGGACGC GGGCGTGAAA TCCGGGGTCA CGGTGCTGAC CGCGGACGCC CCCTTCACCG CCGGGGTGCA TGTCATGGGC GGTGCCCCCG GCACGCGGGA GACGGACCTT CTGGCCCCGG ACAAGACGGT GGAGCAGGTG GATGCGCTGG TGCTGTCGGG CGGCTCGGCC CTGGGGCTCG ACGCGGCCTC GGGCGTGGCG GATGCCCTGC GCGCGGCGGG GCGCGGGTTC GACGTGGGCG GGCAGCGCGT GCCGATCGTG CCCGCTGCGA TCCTCTTCGA TCTGCTCAAT GGCGGCGACA AGGACTGGAT GGTGAACCCC TATAACTGGC TGGGGCGCGA GGCGCTGGAA GCCGCAGCAC CCGAGTTCGC GCTGGGCACC GTCGGGGCCG GGACCGGGGC CTTGACCGCG ACGCTGAAGG GTGGGCTTGG CTCTGCCTCG GTGGTGCTGC CGGACGGGGT GTGCGTGGGC GCGCTCGTGG CCGTCAATGC GCTGGGCTCG GCGACCATGG GTCCGGGGCG GCATTTCTGG GCGGCCCCGT TCGAGATGGG CACGGAATTC GGCGGGTTGG GCATGGGCGC GGTCGATCCC GCGGCGCTGC CTGCAATCAA GGGCGGGCAC AAGACCGCGA CGACCATCGC CATCGTTGCC ACCGACGCGG CGCTGAGCCA GGCCGCCTGC ACCCGCATGG CGGCGGCCGC CCATGACGGG ATGGCCCGGG CGCTGGTCCC GTCGCACACG CCCATGGACG GCGATCTGGT CTTTGCGGCC TCCCACGGGG ACAAGGCGGG CGATCCGCTG ATGATCGGCC ATGCGGCGGC CCTGTGCCTC GCCCGTGCCA TCGCCCGTGG GGTGTACGCC GCGACGCCGG AGGCGGGCGA CCTGCTGCCC TGCTGGTCGG ACCTGCCGGG TTAG
|
Protein sequence | MSARMRPGPR NLITDVAGLR VGHAQDAGVK SGVTVLTADA PFTAGVHVMG GAPGTRETDL LAPDKTVEQV DALVLSGGSA LGLDAASGVA DALRAAGRGF DVGGQRVPIV PAAILFDLLN GGDKDWMVNP YNWLGREALE AAAPEFALGT VGAGTGALTA TLKGGLGSAS VVLPDGVCVG ALVAVNALGS ATMGPGRHFW AAPFEMGTEF GGLGMGAVDP AALPAIKGGH KTATTIAIVA TDAALSQAAC TRMAAAAHDG MARALVPSHT PMDGDLVFAA SHGDKAGDPL MIGHAAALCL ARAIARGVYA ATPEAGDLLP CWSDLPG
|
| |