Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2810 |
Symbol | |
ID | 5707002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3191532 |
End bp | 3193163 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272266 |
Product | cholesterol oxidase |
Protein accession | YP_001537636 |
Protein GI | 159038383 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00131697 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGGTA CGAGCTTTTC CCGACGTGGC CTGCTACGGG CTACCACGCT CGGTGCCGGC GCCGCCGTTG CCGGTGCCGC GTTAGCCCAG CGGGCAGCCG CGGCACAGGG TGTCGTGCCC GCGCTGGCCG GCAAGACCGC CGTGGTCGTC GGCAGCGGCT TCGGCGGGGC GGTCGCCGCG TACCGACTCG GCCAGGCCGG TGTGATCACC ACCGTGCTGG AACGCGGTCT CCGCTGGGAC GTCGACGGCT CGGGCAACAC GTTCTGTGGC ATCAACGAGC CGGACTGGCG GTGCGGCTGG TTCCTGGACC GTCCGCCGCT GGGCATCAAC CTCGGCGCAA GGATCGAACG TCGTGCCGGC CTGATCGCCC GCCACGAAGG CGACGGAATC AATGTCCTCA GCGGTGTGGG GGTCGGCGGC GGCTCCCTCG CGATCGGGAT GTTCCTACCG CAGCCGCGGC GCAGCGAGTG GGAGCAGGTG TACCCGGCCG ACGTCGGCTA CGACGAGATG AACACCATCT ACTGGCCGCG AGCCCGGCAA CGCCTCGGGG CCTCGCCGAT ACCAGAGGAC GTGCAGAGCA CAGGTCCGTA CCGGGGGGCC CGAGCCTGGC TGGAGTACCT GTCGGAGTTC GACCAGAATC CCCTGTCCAT CCCGTTCGCC GTCGACTGGG ACGTGATCCG TGCCGAACTC GCGGGTGACG CGGTGGCATG CCACACGATC GGCGAGGGCC CGTACGGCAG CAACTCCGGG GCGAAGAACA GCGTGGACCG CAACTACCTG GCCTGGGCCG CCGCTACCGG CAACGTGACG ACACTTCCCC TCCACGAGGT CACCGAGATT CATGAGGTGT CCGGCCAGGA CAGGTTCGAG GTCAGGTGCC GACAGATCGA CGTGTACGGC ACGGTCCTCG CCACCAGGAC CTTCGCCTGC GACTACCTGT TCCTGGCCGC CGGCTCCGTC TACACCACCT CCCTGCTGCT CACCTCCCAG GCCAAGGGCT GGCTCCCCCG CCTGGTCAAC CCGGAGGTGG GCAAGGGCTG GGGCAACAAC GGCGACTTCC TGGTAACCCG GATCAACCTG CGCAAGGACG TCGGCTACGC CCAGGGCGGT CCGGGCAACG TGAAGTACAT CGACGACGAC AACCCGTTCG CCCCCACGTC GATGGCGTGG GAGGCAGCGC CCGTCCCCAA CTGGATGCCA CGCACCACCG CGCACCTGGT GACCAGCATG GCACCCGAAC GTGGCGAGAT CCGCTACGAC GCGACGACCG GAGCCGCCAA GGTGCACTGG CCGTACGGGG TGCTGCAGAC CACCGCTGAA AAGGCGGCGG TCAACCTGGT GACCCGGCTG TGGTGGCAGA CCGAGGGCCG TAAGGGATAC CTGCTCAACG GCCTACCGAC CTACGCCCGG GGGGTCGGCA CCGGGCTCGG CGCGGCGAAC ACCTGGCACC CGCTGGGCGG CATGGTCATG GGCGGGGCCA CCGACTTCGG GGGTCGCTGC GTCGACTATC CCAACCTCTT CTGCGTCGAC GGGTCGATCC TGCCGGGATC GGCCTGCCTC GCGAATCCTG CGCTGACCAT CACCGCCAAC GCCGAGCGTT GCCTGGACAG GTTCGTCGCC GCGCACACCT GA
|
Protein sequence | MTGTSFSRRG LLRATTLGAG AAVAGAALAQ RAAAAQGVVP ALAGKTAVVV GSGFGGAVAA YRLGQAGVIT TVLERGLRWD VDGSGNTFCG INEPDWRCGW FLDRPPLGIN LGARIERRAG LIARHEGDGI NVLSGVGVGG GSLAIGMFLP QPRRSEWEQV YPADVGYDEM NTIYWPRARQ RLGASPIPED VQSTGPYRGA RAWLEYLSEF DQNPLSIPFA VDWDVIRAEL AGDAVACHTI GEGPYGSNSG AKNSVDRNYL AWAAATGNVT TLPLHEVTEI HEVSGQDRFE VRCRQIDVYG TVLATRTFAC DYLFLAAGSV YTTSLLLTSQ AKGWLPRLVN PEVGKGWGNN GDFLVTRINL RKDVGYAQGG PGNVKYIDDD NPFAPTSMAW EAAPVPNWMP RTTAHLVTSM APERGEIRYD ATTGAAKVHW PYGVLQTTAE KAAVNLVTRL WWQTEGRKGY LLNGLPTYAR GVGTGLGAAN TWHPLGGMVM GGATDFGGRC VDYPNLFCVD GSILPGSACL ANPALTITAN AERCLDRFVA AHT
|
| |