Gene Sare_2912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2912 
Symbol 
ID5703730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3292897 
End bp3294144 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID641272361 
Productputative oxygenase subunit protein 
Protein accessionYP_001537729 
Protein GI159038476 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.69464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAA TACTGATCGT CGGTGCCGGG CAGTCCGGCC TGCAACTCGC CCACGGGCTA 
CTCGCCGAGG GCTACGAGGT GACGATCATG TCCGCTCGCA CCCCGGACGA GATCCGTAAC
GGCTGGCCAA CCTCCACCCA GGCCATGTTC GCTCCCGCGC TGGACACCGA ACGCCGCTAC
GAGCTGAACC TGTGGGACGA CCAGGCGCCC CCGATCGCGG GCCTGCGCGT CAACCTGTCC
GCACCGCCGG GTACCCGCGC GCTGGACGTC ATCGCCGAGT TGGAGCGACC AGCCCAGTCC
ACCGACCAAC GGCTGAAACT GGCTGCCTGG CTCGAACTCG CCGAGCACCG CGGCGCCACT
GTCGTCCACA ACACGGCCTC CGCCGCCGAC CTGGACACCC TCACCACGCA CGGCCACTAC
GACCTGACCA TCGTCGCCGC CGGCAGAAGC GACCTCGCCG CCGCGTTCGA CCGAGACCCA
GCCCGTTCAC CCCACACAAC CCCACAACGC GGCCTGGCCA TCGCCTACGT CCACGGCCTC
GCTCCCGACC CGGACTGGCC CGCCCCGCAC GTCGGATTCC ACGCCGTGCC TGGCCTCGGC
GAGTTGTTCG TGATTCCTGC CCTCACGCAC GCCGGCGCCT GCGACATCCT CTTCTGGGAA
GCTGTCCCAG GTGAGGACCT GGACCGCTGG CCCGCCAACG GTAGCCGTGT GCCACCCACC
GAGCACCTTC AGACCACCCT CGACCTCGCC AAGCAGTACG TGCCCTGGGT GTATGAGCGA
TGCCGGAACG TCGAACTGAC CGACGCCAAG GCCACCCTGC ACGGTCGGTT CACCCCCACC
GTACGCACAC CGATCGCCCA CCTACCCGGC GGCGGCGTCG CACTGGGCAT GGCTGATGTG
GTCGTCACCA ACGACCCCAT CACCGGCCAG GGCGCCAACA CCGCTGCCAA ATGCGCCGAC
CACTACCTAC GCGCCATCCT CGCCCACGCC GACCGACCCT TCGACACCAC CTGGATGCGC
GACACCTTCG AGGCGTTCTG GACCACCACC GCTCGCGCGG TCACTGCCTG GACCAACGCC
ATGCTGCAAC CCCTACCAGA GCATGTGCAG CAGATCCTCG CCACCGCCGC CACCAACCAG
GCAGTCGCCC AGCGATTCGC CGCCGGGTTC GCCGATCCGA GCACTCTCAC CGACTGGTTG
ATGACCCCCA CCGGCGCAGC CGACTACCTC GCCTCCATCC GTACCTGA
 
Protein sequence
MRRILIVGAG QSGLQLAHGL LAEGYEVTIM SARTPDEIRN GWPTSTQAMF APALDTERRY 
ELNLWDDQAP PIAGLRVNLS APPGTRALDV IAELERPAQS TDQRLKLAAW LELAEHRGAT
VVHNTASAAD LDTLTTHGHY DLTIVAAGRS DLAAAFDRDP ARSPHTTPQR GLAIAYVHGL
APDPDWPAPH VGFHAVPGLG ELFVIPALTH AGACDILFWE AVPGEDLDRW PANGSRVPPT
EHLQTTLDLA KQYVPWVYER CRNVELTDAK ATLHGRFTPT VRTPIAHLPG GGVALGMADV
VVTNDPITGQ GANTAAKCAD HYLRAILAHA DRPFDTTWMR DTFEAFWTTT ARAVTAWTNA
MLQPLPEHVQ QILATAATNQ AVAQRFAAGF ADPSTLTDWL MTPTGAADYL ASIRT