Gene Sare_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0201 
Symbol 
ID5706220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp217771 
End bp220080 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content70% 
IMG OID641269727 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001535127 
Protein GI159035874 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00447833 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGCCA TGGCCTGGAT CGGCTCCGAC CATGTCCGCC ACGACGGCAC CGAGAAGGTC 
CGCGGCGAAC CGATCTATGG TGCCGATCGC ACGGCCGAGG CGATGACGTA CGCGGTCCCG
GTCAGTGCGA CCGTGGGCCG GGGCCGGATC ACGGCGCTCG ACACTACCGC CGCGCAGCGG
GTGCCGGGCG TGCTCGCGGT GCTCACACAC GAGAACCTGG ACCGCCTGCA CCCGGCCGAC
TTCGCCTATG GCGTGGGGAG CGCGAGCGCG AGCTACCAGC CGATGCAGGA CGCCGTCGTG
GCCTACCGGG GTCAACCGAT CGCGCTTGTC GTGGCTGAGA CCCTGGAGGC CGCGGCGGAA
GCCGCCGGGT TGGTGACCGC CGGGTACGAG GTCGAACCGT TCGCCGTCAC ACTCGACGAT
CCGGCGGCCG AATCGGTCGA CCAGGCGCAG GCGGTGCCCA CGTTCCCGGT GCTGGAGATC
GGGGAGGGTG ATCGGCTACT CGACCAAGCG CCGGTCGTGG TCGACGCCAC CTACGGCACG
CCGGCGCAGC ACCACAACGG GCTGGAGCTG CTGTCCACCG TCGCCGAGTG GAAGGACGGT
TCACTTCTCA TCCATGAGGG CACTCAGGCG GCGGGACGCG TACGGCACGC GCTGGCGAAT
CAGCTCGGGA TCCCGATGGA AATGGTCCGG GCAGTTGCGC CATACCTCGG TGGCGGCTTC
GGCCAGCGCA CCGGTCAGAC GTTCAACACG GTGCTCGCTG CTCTCGCCGC GCGGCGGATC
GGTCGGCCGG TGAAGCTGAT CGTCCCGCGG GCCGACGTGT TCCACATGGT GCACTTTCGT
CCGGCGTCCC GGCACCGCAT CCGGCTCGGT GCACGGGACG ACGGCACGAT CACCTCCCTG
GTGCACGACG CGCATGCGCA AACGTCACGC CACGATCTGA TGCCGTTCTG GGGGCCGGAA
GTCTCGTCCC GCATGTACGG CATCCCGAAC TTTCGCGGCA CGACCACGCT GGTGCGTCTC
GACACCCAGA CGCCCGGCTA CATGCGGGCG CCGATGGAGA TGGTGACGAT GTTCGCGGTG
GAGAGCGCGC TGGACGAACT CGCCGAGCGG CTACACGTCG ACCCGGTCGA GCTACGCCGG
CGCCACGACA CCGCCACCGA CCCGCTGACC GGCAGACCGT TCTCCTCGCG GCGACTCAAG
CAGTGCCTCG ATCGGGGTGC CGAGCGGTTC GGGTGGTCCA GACGCGATCC GGCGGCCGGG
TCGATGCGCG CCGACGACGG CAGTCTCGTC GGCTGGGGTA TGGCGGCAGG CTGCTATCCC
GGCATCGCCT CCGCCGCCGG CTCCCGGATA CGGCTGCACG AAAACGGCAC GGCGGACGTC
GCGGTCAGCG GACACGAGAT GGGCCAGGGG ATTCGCACCG TGATCGCGCT GGTCGCGGCC
GAATCTCTCG GCCTGCCGCC AGATCGAATC CGCATCACCA TCGGAGACAC CCGGGTCGCT
CCTCAGCCAG AAACCGGCGG TTCGTGGGGA ACCGCCACCG CTGTGCCCGC GGTCCGGGAC
GCGGCCAACG ACATCCGGGC CCAACTGCAT CAGATCGCCG CCGCCCGTGG CGAGTCCGTC
GCCACCGTCG ACGTCACCGA GTGCCGACTG GCAGACGGCA GGCTGGTCGG CCCGGACAAT
TCGGGGCCAC TGATGACCGG GCTCCTGATG GCCGCCGGCC GCTCCTCGGT TGAGGCGACG
GGGCAGTACT ACGCCCCGGG GCAGCAGCCG TCCGAGGCCC CGACCCTGGC TCCGGCGCGG
AAGAGCGCGG TGATGGCGGA CGTCGGCAGT GCCTTCGTGG GCCCGGCGTT CCCCGGTTTC
GTCACCTGGT CCTACATCGC CCACTTCGTC GAGGTCCGCG TCGGAGCCCG GGTCCGCCGA
CCGCGGGTCA CCCGGATGCT GTCGGTGATC GACTGCGGCC GGGTGATCAG TCGGCGTACC
GCCACCAGCC AGGCACTGGG CGGCCTGGTC TGGGGCATCA GCACCGCGCT CAGTGAGGAG
AGCATCGTGG ACCCCCGCTA CGGCGGGTTT CTCAACTCCA ACCTGGGCGA CTACAAGATG
CCGGTGAACG CCGACATACC CACGCTCGAC GTCGACTTCA TCGACGAGCC CGACCCCTCG
TTCAGCGCGT TCGGTATCAA GGGCCTGGGG GAGGTCGTCC ATGTCGGGGC AGCAGCCGCG
ATCACCAACG CCATCTACCA TGCAACCGGC GTCCGGGTCC GGGATCTGCC CGTACACATC
GAGGACCTGA TGACGGAGAC CTCCCGATGA
 
Protein sequence
MSAMAWIGSD HVRHDGTEKV RGEPIYGADR TAEAMTYAVP VSATVGRGRI TALDTTAAQR 
VPGVLAVLTH ENLDRLHPAD FAYGVGSASA SYQPMQDAVV AYRGQPIALV VAETLEAAAE
AAGLVTAGYE VEPFAVTLDD PAAESVDQAQ AVPTFPVLEI GEGDRLLDQA PVVVDATYGT
PAQHHNGLEL LSTVAEWKDG SLLIHEGTQA AGRVRHALAN QLGIPMEMVR AVAPYLGGGF
GQRTGQTFNT VLAALAARRI GRPVKLIVPR ADVFHMVHFR PASRHRIRLG ARDDGTITSL
VHDAHAQTSR HDLMPFWGPE VSSRMYGIPN FRGTTTLVRL DTQTPGYMRA PMEMVTMFAV
ESALDELAER LHVDPVELRR RHDTATDPLT GRPFSSRRLK QCLDRGAERF GWSRRDPAAG
SMRADDGSLV GWGMAAGCYP GIASAAGSRI RLHENGTADV AVSGHEMGQG IRTVIALVAA
ESLGLPPDRI RITIGDTRVA PQPETGGSWG TATAVPAVRD AANDIRAQLH QIAAARGESV
ATVDVTECRL ADGRLVGPDN SGPLMTGLLM AAGRSSVEAT GQYYAPGQQP SEAPTLAPAR
KSAVMADVGS AFVGPAFPGF VTWSYIAHFV EVRVGARVRR PRVTRMLSVI DCGRVISRRT
ATSQALGGLV WGISTALSEE SIVDPRYGGF LNSNLGDYKM PVNADIPTLD VDFIDEPDPS
FSAFGIKGLG EVVHVGAAAA ITNAIYHATG VRVRDLPVHI EDLMTETSR