Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3854 |
Symbol | |
ID | 8667144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4291582 |
End bp | 4293222 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | choline dehydrogenase |
Protein accession | YP_003339515 |
Protein GI | 271965319 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.131347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00687272 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACGACT TCGTCATCGT TGGGGGCGGA TCGGCCGGAA GCGCTCTGGC GAACCGGTTG TCCGCCGACC CCGCCAACCG GGTGCTGGTG CTGGAGGCCG GCCGCCCGGA CTACCCCTGG GACGTCTTCA TCCACATGCC CGCCGCGCTG ACCTTCCCCA TCGGCAGCCG GTTCTACGAC TGGAAGTACG AGTCCGAGCC CGAGCCGCAC ATGAACGGCC GCCGGATCTA CCACGCCCGC GGCAAGGTCC TGGGCGGTTC CAGCAGCATC AACGGCATGA TCTTCCAGCG GGGCAATCCC CTGGACTACG AGCGCTGGGC CGCCGACCCG GGCATGGAGA CCTGGGACTT CGCCCACTGC CTGCCCTATT TCAAGCGGAT GGAGAACTGC CTCGCCGCGG ACCCGGACGA TCCCCTGCGA GGACACGACG GGCCTCTGGT GCTTGAGCGC GGGCCGGTGC GCAACCCGCT GTTCGCGGCG TTCTTCGAGG CGGCGCAGCA GGCCGGCTAC CCGCTGACCG ACGACGTCAA CGGATACCGC CAGGAGGGCT TCGCCCGTTT CGACCGCACC ATCCGCCGGG GGCGCCGGCT CTCCGCCGCG CGGGCCTACC TGCATCCCGT CAGGAGACGC CCCAACCTGG AGATCAGGAC CCGGGCGTTC GTCACGAGGA TCCTCTTCGA GGGGGGACGC GCCGTCGGCG TCGAGTACAA CGGCCGGACG GTCCGCGCGG GCGAGGTCGT CCTCTGCGGC GGCGCGATCA ACTCCCCGCA GCTGCTCCAG CTCTCCGGTG TGGGCGACGC CGCCGAACTG GGCGCCCTCG GCGTCGACGT CGTGCACGAC CTGCCGGGGG TGGGGGAGAA CCTGCAGGAC CATCTGGAGG TCTACATCCA GTACGGCTGC AGGCGGCCGG TGTCGATGCA GCCCGCGATG AAGTGGCGCA ACCGGCCGTG GATAGGCGCG CAATGGCTGT TCCTGCGCAG CGGGCCCGGA GCGACCAACC ACTTCGAGGC GGGCGGTTTC GTTCGCGGCA ACGACGACGT CGACTACCCC AACCTGATGT TCCACTTCCT GCCCGTCGCC GTCCGCTACG ACGGGTCCGC GCCCGTCGGC GGGCACGGCT ACCAGGTGCA CATCGGGCCG ATGTACTCCG ACGCGCGCGG CTCGGTGAAG ATCAGGAGCA CCGATCCCCG GGTCCATCCG GCGCTGCGGT TCAACTACCT GTCCACCGCG CGGGACCGGC GGGAGTGGGT GGAGGCGGTC CGGGTCGCCC GCGACGTCCT GACCCAGCGG GCGATGGACG AGTTCAACGC GGGGGAGCTG TCGCCCGGAC CGGAGGTCCG GACCGACCAG GAGATCCTGG ACTGGGTGGC CAAGGACGGC GAGACCGCGC TGCACCCCTC CTGCACCGCC CGGATGGGCG TCGACGACCT CGCCGTCGTC GATCCCCTCT CCATGAGGGT CCACGGCCTC GACGGGCTCC GCGTCGTGGA CGCCTCGGTC ATGCCGTACG TGACCAACGG CAACATCTAC GCGCCGGTCA TGATGGTCGC GGAGAAGGCG GCGGACCTCA TCCTGGGCGA CACGCCGATG GCGGCCGAAC CCGCCGGCTT CTACCGGCAC CGCGGGGGCG CGGACGGCTG A
|
Protein sequence | MYDFVIVGGG SAGSALANRL SADPANRVLV LEAGRPDYPW DVFIHMPAAL TFPIGSRFYD WKYESEPEPH MNGRRIYHAR GKVLGGSSSI NGMIFQRGNP LDYERWAADP GMETWDFAHC LPYFKRMENC LAADPDDPLR GHDGPLVLER GPVRNPLFAA FFEAAQQAGY PLTDDVNGYR QEGFARFDRT IRRGRRLSAA RAYLHPVRRR PNLEIRTRAF VTRILFEGGR AVGVEYNGRT VRAGEVVLCG GAINSPQLLQ LSGVGDAAEL GALGVDVVHD LPGVGENLQD HLEVYIQYGC RRPVSMQPAM KWRNRPWIGA QWLFLRSGPG ATNHFEAGGF VRGNDDVDYP NLMFHFLPVA VRYDGSAPVG GHGYQVHIGP MYSDARGSVK IRSTDPRVHP ALRFNYLSTA RDRREWVEAV RVARDVLTQR AMDEFNAGEL SPGPEVRTDQ EILDWVAKDG ETALHPSCTA RMGVDDLAVV DPLSMRVHGL DGLRVVDASV MPYVTNGNIY APVMMVAEKA ADLILGDTPM AAEPAGFYRH RGGADG
|
| |