Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1948 |
Symbol | |
ID | 5712942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2038268 |
End bp | 2039287 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641267873 |
Product | putative basic membrane protein |
Protein accession | YP_001533290 |
Protein GI | 159044496 |
COG category | [R] General function prediction only |
COG ID | [COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00107991 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000839366 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCACAGC CACCAGCCAG GACCGGCCTC AGCCGCCGCG CCTTCGTCTC CTCCGCCCTT GCGCTGGGCG CTGCGGGCGT CCTCGTCCGC CCGGCCGCCG CCGCCGACCC GATCAAGGTC GCGGGCGTCT ACACCGTCCC GGTGGAGCAG CAATGGGTCA GCCGCATCCA TATCGCCGCT GAAGCCGCCG CCGCCGCGGG CCAGATCACC TACACCTTCT CCGAAAACGT CGCCAATACC GACTACCCCC GCGTGATGCG CGAATACGCC GAGAGCGGGA TCGAGCTGAT GATCGGCGAG GTCTTCGCGG TCGAGGCCGA GGCCCGCGAG GTCGCCGCCG ACTACCCCGA GGTGGCCTTC CTGATGGGCT CCTCCTTCCT TGAGGACCCG AGCCTGCCCA ATTTCGCCGT GTTCGACAAC TACATCCAGG ACGCGGCCTA CCTGACCGGC CTGATCGCGG GGGCGATGTC CGAGGCGGGC AATATCGGCA TGGTCGGCGG CTTCCCGATC CCCGAGGTCA ACCGCCTGAT GCACGCCTTC ATGGCCGGCG CGCGCGAGAT CAACCCGGAC GTGACCTTCC AGGTCAGCTT CATCGGGTCG TGGTTCGACC CGCCCAAGGC CAAGGAAACC GCCTTCGCCA TGATCGAGAA CGGCGCCGAC CTTCTCTATG CCGAACGCTT CGGGGTGTCG GACGCGGCGC AGGAACGGGG CCTTCTGGCC ATCGGCAACG TGATCGACAC CCAGGCGGAT TATCCCGACA CCGTGGTCGC CTCGGCCCTG TGGCATTTCG AGCCGACCCT GCAGGCCGCC ATCGCGGCGG TCAACGCGGG CGAATTCGAG GCGGCGAATT ACGGGGTCTT TTCCTACATG CGCGAAGGCG GCAGCAGCCT CGCGCCGCTG GGCACCTTCG AGGACAAGGT CCCGGCCGAG ATCAAGACCC TGGTGCAGGA ACGCCAGGAC GCCATCAAGG CCGGCACCTT CACCGTCGAG ATCAACGACG AAGAGCCGAC CTCCTCCTGA
|
Protein sequence | MSQPPARTGL SRRAFVSSAL ALGAAGVLVR PAAAADPIKV AGVYTVPVEQ QWVSRIHIAA EAAAAAGQIT YTFSENVANT DYPRVMREYA ESGIELMIGE VFAVEAEARE VAADYPEVAF LMGSSFLEDP SLPNFAVFDN YIQDAAYLTG LIAGAMSEAG NIGMVGGFPI PEVNRLMHAF MAGAREINPD VTFQVSFIGS WFDPPKAKET AFAMIENGAD LLYAERFGVS DAAQERGLLA IGNVIDTQAD YPDTVVASAL WHFEPTLQAA IAAVNAGEFE AANYGVFSYM REGGSSLAPL GTFEDKVPAE IKTLVQERQD AIKAGTFTVE INDEEPTSS
|
| |