Gene Sare_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0202 
Symbol 
ID5706221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp220077 
End bp221075 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content68% 
IMG OID641269728 
Productmolybdopterin dehydrogenase FAD-binding 
Protein accessionYP_001535128 
Protein GI159035875 
COG category[C] Energy production and conversion 
COG ID[COG1319] Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.955618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00748025 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGACCT TCACCTACAC CCGGGTGCGC TCCGTACCAG AGGCCGTGAC GGCGTTCGCC 
ACCTCCACCG ATGACCGGCC GCACTACCTC AGCGGTGGCA CGACCCTGGT AGACCTGATG
AAACTGGACA TCGAAGCACC CGGCCAGGTC ATCGACCTGA CTGACGTCAC CGACCTCGAC
TTCGTCCGCG AGGAAGACGG TGACCTGGTC ATCGGCGCGT TGACCCGGAT GAGTGACGTC
GCGAACCACC CGCTGGTCGG TGCCAGATGT CCGGCGCTCG CGGATTCACT GCTCTCCGGG
GCTTCGCAAC AGCTTCGGAA CATGGCGCGA GTTGGCGGCA ACCTGCTGCA GCGCACCCGC
TGCGACTACT TCCGATCGGT CGAGTTTCCC TGCAACAAGC GGCGGCCTGG GTCGGGATGT
GCCGCGATCG GCGGCGTCAA CCGCCAGCAC GCCATCCTCG GTACCAGCGA ACACTGCATC
GCCGTGTACC CGGGTGACTG GGCGGTGGCG CTGACCGCGT TCGACGCGAA CCTCGCGGTC
GTCGGACCGA GTGGCACCCG CTCCATACCG ATCCATGACC TGATCGTGCC GCCCGGCGAC
ACACCACACC GGGAGACAAC ACTCACGCCC GGCGAGTTCA TCACGACCAT CCGGGTGCCG
ATGACACCGA CGGCACGGTC GTCCAGCTAT CGCAAGGTCG GCGACCGGAG TTCGTACTCG
TTCGCGCTGG CGTCGGCCGC CGTCGGGCTC CACCTCGACG CGGGCGGCAC CGTCGACGAG
GTACGGATCG CCCTCGGTGG CCTGGGCACC GTGCCCTGGC GACTGTGGGA CGCCGAACGA
GCACTGACCG GTGGCCGGCT GGACGACGCA ACCGTACGGG CGGCGCTGGA GCCGGAGTTC
CGGTCGGCAT GCACCACGAG GCAGAACGCC TTCCGGGTGC GGTTGGGTGT GGAAACCGTC
CTTGAGGCTG TCGCCTCGGC GCAGGAGAGG GTGGGATGA
 
Protein sequence
MRTFTYTRVR SVPEAVTAFA TSTDDRPHYL SGGTTLVDLM KLDIEAPGQV IDLTDVTDLD 
FVREEDGDLV IGALTRMSDV ANHPLVGARC PALADSLLSG ASQQLRNMAR VGGNLLQRTR
CDYFRSVEFP CNKRRPGSGC AAIGGVNRQH AILGTSEHCI AVYPGDWAVA LTAFDANLAV
VGPSGTRSIP IHDLIVPPGD TPHRETTLTP GEFITTIRVP MTPTARSSSY RKVGDRSSYS
FALASAAVGL HLDAGGTVDE VRIALGGLGT VPWRLWDAER ALTGGRLDDA TVRAALEPEF
RSACTTRQNA FRVRLGVETV LEAVASAQER VG