Gene Sare_2810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2810 
Symbol 
ID5707002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3191532 
End bp3193163 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content69% 
IMG OID641272266 
Productcholesterol oxidase 
Protein accessionYP_001537636 
Protein GI159038383 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00131697 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGGTA CGAGCTTTTC CCGACGTGGC CTGCTACGGG CTACCACGCT CGGTGCCGGC 
GCCGCCGTTG CCGGTGCCGC GTTAGCCCAG CGGGCAGCCG CGGCACAGGG TGTCGTGCCC
GCGCTGGCCG GCAAGACCGC CGTGGTCGTC GGCAGCGGCT TCGGCGGGGC GGTCGCCGCG
TACCGACTCG GCCAGGCCGG TGTGATCACC ACCGTGCTGG AACGCGGTCT CCGCTGGGAC
GTCGACGGCT CGGGCAACAC GTTCTGTGGC ATCAACGAGC CGGACTGGCG GTGCGGCTGG
TTCCTGGACC GTCCGCCGCT GGGCATCAAC CTCGGCGCAA GGATCGAACG TCGTGCCGGC
CTGATCGCCC GCCACGAAGG CGACGGAATC AATGTCCTCA GCGGTGTGGG GGTCGGCGGC
GGCTCCCTCG CGATCGGGAT GTTCCTACCG CAGCCGCGGC GCAGCGAGTG GGAGCAGGTG
TACCCGGCCG ACGTCGGCTA CGACGAGATG AACACCATCT ACTGGCCGCG AGCCCGGCAA
CGCCTCGGGG CCTCGCCGAT ACCAGAGGAC GTGCAGAGCA CAGGTCCGTA CCGGGGGGCC
CGAGCCTGGC TGGAGTACCT GTCGGAGTTC GACCAGAATC CCCTGTCCAT CCCGTTCGCC
GTCGACTGGG ACGTGATCCG TGCCGAACTC GCGGGTGACG CGGTGGCATG CCACACGATC
GGCGAGGGCC CGTACGGCAG CAACTCCGGG GCGAAGAACA GCGTGGACCG CAACTACCTG
GCCTGGGCCG CCGCTACCGG CAACGTGACG ACACTTCCCC TCCACGAGGT CACCGAGATT
CATGAGGTGT CCGGCCAGGA CAGGTTCGAG GTCAGGTGCC GACAGATCGA CGTGTACGGC
ACGGTCCTCG CCACCAGGAC CTTCGCCTGC GACTACCTGT TCCTGGCCGC CGGCTCCGTC
TACACCACCT CCCTGCTGCT CACCTCCCAG GCCAAGGGCT GGCTCCCCCG CCTGGTCAAC
CCGGAGGTGG GCAAGGGCTG GGGCAACAAC GGCGACTTCC TGGTAACCCG GATCAACCTG
CGCAAGGACG TCGGCTACGC CCAGGGCGGT CCGGGCAACG TGAAGTACAT CGACGACGAC
AACCCGTTCG CCCCCACGTC GATGGCGTGG GAGGCAGCGC CCGTCCCCAA CTGGATGCCA
CGCACCACCG CGCACCTGGT GACCAGCATG GCACCCGAAC GTGGCGAGAT CCGCTACGAC
GCGACGACCG GAGCCGCCAA GGTGCACTGG CCGTACGGGG TGCTGCAGAC CACCGCTGAA
AAGGCGGCGG TCAACCTGGT GACCCGGCTG TGGTGGCAGA CCGAGGGCCG TAAGGGATAC
CTGCTCAACG GCCTACCGAC CTACGCCCGG GGGGTCGGCA CCGGGCTCGG CGCGGCGAAC
ACCTGGCACC CGCTGGGCGG CATGGTCATG GGCGGGGCCA CCGACTTCGG GGGTCGCTGC
GTCGACTATC CCAACCTCTT CTGCGTCGAC GGGTCGATCC TGCCGGGATC GGCCTGCCTC
GCGAATCCTG CGCTGACCAT CACCGCCAAC GCCGAGCGTT GCCTGGACAG GTTCGTCGCC
GCGCACACCT GA
 
Protein sequence
MTGTSFSRRG LLRATTLGAG AAVAGAALAQ RAAAAQGVVP ALAGKTAVVV GSGFGGAVAA 
YRLGQAGVIT TVLERGLRWD VDGSGNTFCG INEPDWRCGW FLDRPPLGIN LGARIERRAG
LIARHEGDGI NVLSGVGVGG GSLAIGMFLP QPRRSEWEQV YPADVGYDEM NTIYWPRARQ
RLGASPIPED VQSTGPYRGA RAWLEYLSEF DQNPLSIPFA VDWDVIRAEL AGDAVACHTI
GEGPYGSNSG AKNSVDRNYL AWAAATGNVT TLPLHEVTEI HEVSGQDRFE VRCRQIDVYG
TVLATRTFAC DYLFLAAGSV YTTSLLLTSQ AKGWLPRLVN PEVGKGWGNN GDFLVTRINL
RKDVGYAQGG PGNVKYIDDD NPFAPTSMAW EAAPVPNWMP RTTAHLVTSM APERGEIRYD
ATTGAAKVHW PYGVLQTTAE KAAVNLVTRL WWQTEGRKGY LLNGLPTYAR GVGTGLGAAN
TWHPLGGMVM GGATDFGGRC VDYPNLFCVD GSILPGSACL ANPALTITAN AERCLDRFVA
AHT