Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1942 |
Symbol | |
ID | 5712936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2031291 |
End bp | 2032652 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641267867 |
Product | putative GSCFA family protein |
Protein accession | YP_001533284 |
Protein GI | 159044490 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.562926 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000963078 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATCCA ACCCGATCGA GACCGGCCCC GCACCAGAGG TGCTGCGCCG GGCGTCCCGC AACCCGCTGC GCCGCTACCC CACGCCCGAC GCCGGGGGGG ACCGGCTCTA CCCGCTGGCC ATGCCCGCGC CGACCCCGTC CTTCGAATTC AGTTCAAAAG AAACCGTCTT CGCCCTCGGC TCCTGCTTTG CCCGCAACAT CGAGGACGCG CTGGCCGCCG AAGGCTTCCG CGTGCTCAGC CGCGAGTTCG ACCTGGGCGA GATCGGCGCG AGCCTCGATG ATGCCGCCAA TTTCTTCAAC AAGTATACCA TCCACTCCAT GCTGAACGAG CTGCGCTGGG CCTTCGACCG CGACAGCTTC CCCGGCGCGG ACATGCTCTA CCCCCTGCGC GACGACGACA CCGCCTACCG CGACCTGCAA CTGGGCTCGG CCAAGCTCGC CTTCCCGCGC GACCGCATCC TCGCCTTCCG CCACCGGTTT CTGGATGCCG TGGCCCAGAT CGCCGAGGCC GACGTGGTGG TGATGACCCT GGGCTATGTC GAAGGCTGGC GCGACACCCG GCTCGACCTG GCGCTCAACA CCGCGCCCCC TCCGGCACTC TGCGCCCGCG AGCCCGACCG CTTCGCCTTC GAGGTGCTGA GCTACGAGGA TGTGCTGGGC GGACTGCGCG CCTTCCACGC GCTGCTGACC GCCCACCGCA CCAAGCCGCT CAAGATGCTG CTGACCGTCT CGCCCGTCCC GCTCCTGAGC ACCTTCCGCG ACATGGACGT GCTGGTCGCC AACTCCTACT CCAAGGCCGT GCAACGCGCC GCGGTCGAAA CCTTCGTCGC CGAAACCCCC GGCGTCGACT ACTTCCCGTC CTACGAATGC GTCACCCTGA GCGACCCGGC CGCGATCTGG ACCGAGGGCG ACTTCCGCCA CGTCGCCCCC GATCTGGTCA CCCGCATCAT GTCCAGCGTC CTGACCGCCT ATGTCCCCGG CTGGGGCGAT AAGGGCGCGC TTACCCGCGC GGCGACCCGC GCCACCACGC GGCTTCTGCT CGGCGCCGGA CGCCATGACG AGCTTCTGGC GCTGCTCGAC GCCCACGGCC CCACGGACGA TGCCGAGCTG ACCGCCGCCC ACGCCCTCGC CCTGCGCCGC ACCGACCGCA CCGCGCAAGC CGTGGCCCTG ATGTGCGAGG TGGTCGAACG CACCCCCGAC GACCCCCAGC CCCTCGAACG GGTGATCCGC TGGTGCGAAC AACTCGACCG CATGGCCGTG GCCCGCGACT ACCTCGACCT GCACGCCCAA CGCTTCCCCA AGCGCCGCAA GTTCCGCCGG GGCCGCAAGT GCCGCAAGGC CGCCAACCGG GGCCGCGGCT GA
|
Protein sequence | MTSNPIETGP APEVLRRASR NPLRRYPTPD AGGDRLYPLA MPAPTPSFEF SSKETVFALG SCFARNIEDA LAAEGFRVLS REFDLGEIGA SLDDAANFFN KYTIHSMLNE LRWAFDRDSF PGADMLYPLR DDDTAYRDLQ LGSAKLAFPR DRILAFRHRF LDAVAQIAEA DVVVMTLGYV EGWRDTRLDL ALNTAPPPAL CAREPDRFAF EVLSYEDVLG GLRAFHALLT AHRTKPLKML LTVSPVPLLS TFRDMDVLVA NSYSKAVQRA AVETFVAETP GVDYFPSYEC VTLSDPAAIW TEGDFRHVAP DLVTRIMSSV LTAYVPGWGD KGALTRAATR ATTRLLLGAG RHDELLALLD AHGPTDDAEL TAAHALALRR TDRTAQAVAL MCEVVERTPD DPQPLERVIR WCEQLDRMAV ARDYLDLHAQ RFPKRRKFRR GRKCRKAANR GRG
|
| |