Gene Sros_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3854 
Symbol 
ID8667144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4291582 
End bp4293222 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content70% 
IMG OID 
Productcholine dehydrogenase 
Protein accessionYP_003339515 
Protein GI271965319 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.131347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00687272 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACGACT TCGTCATCGT TGGGGGCGGA TCGGCCGGAA GCGCTCTGGC GAACCGGTTG 
TCCGCCGACC CCGCCAACCG GGTGCTGGTG CTGGAGGCCG GCCGCCCGGA CTACCCCTGG
GACGTCTTCA TCCACATGCC CGCCGCGCTG ACCTTCCCCA TCGGCAGCCG GTTCTACGAC
TGGAAGTACG AGTCCGAGCC CGAGCCGCAC ATGAACGGCC GCCGGATCTA CCACGCCCGC
GGCAAGGTCC TGGGCGGTTC CAGCAGCATC AACGGCATGA TCTTCCAGCG GGGCAATCCC
CTGGACTACG AGCGCTGGGC CGCCGACCCG GGCATGGAGA CCTGGGACTT CGCCCACTGC
CTGCCCTATT TCAAGCGGAT GGAGAACTGC CTCGCCGCGG ACCCGGACGA TCCCCTGCGA
GGACACGACG GGCCTCTGGT GCTTGAGCGC GGGCCGGTGC GCAACCCGCT GTTCGCGGCG
TTCTTCGAGG CGGCGCAGCA GGCCGGCTAC CCGCTGACCG ACGACGTCAA CGGATACCGC
CAGGAGGGCT TCGCCCGTTT CGACCGCACC ATCCGCCGGG GGCGCCGGCT CTCCGCCGCG
CGGGCCTACC TGCATCCCGT CAGGAGACGC CCCAACCTGG AGATCAGGAC CCGGGCGTTC
GTCACGAGGA TCCTCTTCGA GGGGGGACGC GCCGTCGGCG TCGAGTACAA CGGCCGGACG
GTCCGCGCGG GCGAGGTCGT CCTCTGCGGC GGCGCGATCA ACTCCCCGCA GCTGCTCCAG
CTCTCCGGTG TGGGCGACGC CGCCGAACTG GGCGCCCTCG GCGTCGACGT CGTGCACGAC
CTGCCGGGGG TGGGGGAGAA CCTGCAGGAC CATCTGGAGG TCTACATCCA GTACGGCTGC
AGGCGGCCGG TGTCGATGCA GCCCGCGATG AAGTGGCGCA ACCGGCCGTG GATAGGCGCG
CAATGGCTGT TCCTGCGCAG CGGGCCCGGA GCGACCAACC ACTTCGAGGC GGGCGGTTTC
GTTCGCGGCA ACGACGACGT CGACTACCCC AACCTGATGT TCCACTTCCT GCCCGTCGCC
GTCCGCTACG ACGGGTCCGC GCCCGTCGGC GGGCACGGCT ACCAGGTGCA CATCGGGCCG
ATGTACTCCG ACGCGCGCGG CTCGGTGAAG ATCAGGAGCA CCGATCCCCG GGTCCATCCG
GCGCTGCGGT TCAACTACCT GTCCACCGCG CGGGACCGGC GGGAGTGGGT GGAGGCGGTC
CGGGTCGCCC GCGACGTCCT GACCCAGCGG GCGATGGACG AGTTCAACGC GGGGGAGCTG
TCGCCCGGAC CGGAGGTCCG GACCGACCAG GAGATCCTGG ACTGGGTGGC CAAGGACGGC
GAGACCGCGC TGCACCCCTC CTGCACCGCC CGGATGGGCG TCGACGACCT CGCCGTCGTC
GATCCCCTCT CCATGAGGGT CCACGGCCTC GACGGGCTCC GCGTCGTGGA CGCCTCGGTC
ATGCCGTACG TGACCAACGG CAACATCTAC GCGCCGGTCA TGATGGTCGC GGAGAAGGCG
GCGGACCTCA TCCTGGGCGA CACGCCGATG GCGGCCGAAC CCGCCGGCTT CTACCGGCAC
CGCGGGGGCG CGGACGGCTG A
 
Protein sequence
MYDFVIVGGG SAGSALANRL SADPANRVLV LEAGRPDYPW DVFIHMPAAL TFPIGSRFYD 
WKYESEPEPH MNGRRIYHAR GKVLGGSSSI NGMIFQRGNP LDYERWAADP GMETWDFAHC
LPYFKRMENC LAADPDDPLR GHDGPLVLER GPVRNPLFAA FFEAAQQAGY PLTDDVNGYR
QEGFARFDRT IRRGRRLSAA RAYLHPVRRR PNLEIRTRAF VTRILFEGGR AVGVEYNGRT
VRAGEVVLCG GAINSPQLLQ LSGVGDAAEL GALGVDVVHD LPGVGENLQD HLEVYIQYGC
RRPVSMQPAM KWRNRPWIGA QWLFLRSGPG ATNHFEAGGF VRGNDDVDYP NLMFHFLPVA
VRYDGSAPVG GHGYQVHIGP MYSDARGSVK IRSTDPRVHP ALRFNYLSTA RDRREWVEAV
RVARDVLTQR AMDEFNAGEL SPGPEVRTDQ EILDWVAKDG ETALHPSCTA RMGVDDLAVV
DPLSMRVHGL DGLRVVDASV MPYVTNGNIY APVMMVAEKA ADLILGDTPM AAEPAGFYRH
RGGADG