Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3738 |
Symbol | |
ID | 5714267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009955 |
Strand | + |
Start bp | 143895 |
End bp | 145256 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641276653 |
Product | transposase IS116/IS110/IS902 family protein |
Protein accession | YP_001541949 |
Protein GI | 159046277 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAGC GTAAGACCGG CGGCCATCCG GATTTGAACG TGGTCAACCC TAATGCGGCT GCGATCGACA TTGGTTCGAC CATGCATATG GCTGCTGTGA ACCCTGACGC CGCCGCCATG CCGGTGCGGG CTTTCGGGAC GTTTACACAA GACCTGCATG ATCTCGCCGA CTGGCTCCGG TCGTGCGGCG TCACGAGCGT GGCCATGGAG TCGACCGGCG TTTACTGGAT ACCGGCCTTC GAGATCCTGG AGGCGCATGG GTTCGAGGTT ATCCTTGTGA ATGCCCGCTA CGCCAAGAAT GTGCCAGGCC GCAAAACCGA TGTCAGCGAT GCGGGGTGGC TGCGCCAGTT ACATTCTTAT GGGTTGCTCC GCAGCAGTTT CCGCCCTGCA GCGGAGATAG CCACCCTGCG CGCCTACATG CGCCAGCGCG AACGACTGAC AGAATATGCT GCCGCGCATA TCCAGCATAT GCAGAAGGCG CTGATGGAGA TGAACCTGCA ACTGCATCAT GTCGTCTCCG ACATCACTGG CGCAACGGGC ATGCGGACTA TCCGCGCCAT TGTCGATGGC CAGCGCGATC CTGAAGTTCT CGCCGCATTC CGGGATATCC GCTGCCATTC ATCGCTGGAC ACGATCAAAG CCGCGCTGGT CGGTAACGAC CGCGAAGAAC ATGTCTTTGC CTTGACACAG TCATTGGAGC TTTACGACTT CTACAAAGCG CAGATCGAGG CCTGCGACCG CAGGCTGGAA GCTGCGGTGG GGGCGCTGAC AGTCCGGGCA GGTGATGGTG TGGCCCCACT GCCCAAGGCG CGGATCAAGG GCACGCAACA CAATGCGCCG TCCTTCGATG TGCGAGCTGC GCTCTATGGC GTGTTAGGCA CGGACCTGAC ACAGATCCAT GGCCTTGGGC CATCGCTGGC GCTGAAGCTG GTGGCCGAAT GCGGCACGGA TTTGCGGGCC TGGAAGAGCG CTAAGCACTT CACGTCGTGG CTCTGTCTGG CACCGGGCAA CAAAATCTCT GGTGGCAAAC TGCTGTCCTC CCGGACACGC CGATCCTCCA GCCGCGCCGC CGCGCTTCTG CGTCTGGCGG CAGTCACAAT AGGCCGAAGC GACACGGCCT TGGGCGCGTT CTATCGACGG CTGTCATCGC GCATAGGCAA ACAAAAGGCG GTCACAGCGA CAGCGCGCAA GATTGCCGTT TTGTTCTACA ACGCCATTCG CCATGGCATG ACCTATCAAG ATCAGGGCGC GGCGGCTTAT GATGAGCGAC ACCGACAACG CGTGCTCTCA AACCTCCATC GCCGCGCCAA GACCCTCGGA TTTGCATTGG CGCCTATTCC CGAGACAGAG GCTGTTTCTT AG
|
Protein sequence | MTKRKTGGHP DLNVVNPNAA AIDIGSTMHM AAVNPDAAAM PVRAFGTFTQ DLHDLADWLR SCGVTSVAME STGVYWIPAF EILEAHGFEV ILVNARYAKN VPGRKTDVSD AGWLRQLHSY GLLRSSFRPA AEIATLRAYM RQRERLTEYA AAHIQHMQKA LMEMNLQLHH VVSDITGATG MRTIRAIVDG QRDPEVLAAF RDIRCHSSLD TIKAALVGND REEHVFALTQ SLELYDFYKA QIEACDRRLE AAVGALTVRA GDGVAPLPKA RIKGTQHNAP SFDVRAALYG VLGTDLTQIH GLGPSLALKL VAECGTDLRA WKSAKHFTSW LCLAPGNKIS GGKLLSSRTR RSSSRAAALL RLAAVTIGRS DTALGAFYRR LSSRIGKQKA VTATARKIAV LFYNAIRHGM TYQDQGAAAY DERHRQRVLS NLHRRAKTLG FALAPIPETE AVS
|
| |