Gene Sare_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3035 
Symbol 
ID5707237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3444038 
End bp3445237 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID641272480 
Producthypothetical protein 
Protein accessionYP_001537848 
Protein GI159038595 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCA CCGTGTGCGT CGTCGGCGGC GGACCCGCAG GGCTCGTCCT GGGGCTGCTG 
CTCGCCCGGC AGGGAGTGGC GGTCACCGTG CTGGAGAAGC ACGCCGACTT CCTCCGGGAC
TTCCGGGGCG ACACCGTGCA CCCCTCCACA CTGAACATGC TCGACGAAAT CGGGCTTGGC
GAACGGATGG CGGCGTTGCG CGGGCGCAAG GCCGGCGCGT TGCGCGCCAC CTTCGACGAC
GGCACGTACG CCATCGTGGA CTTCACCCGC CTGCCGGTGC CCCACAACTA CCTGTACTTC
GTCCCCCAGT GGGACTTCCT GGAAATGCTC GCGACCGAGG CGGCGAGACT TCCGACCTTC
ACGCTGCTCC GCTCCGCCAC CGTGACGGGC CTGCTCCGCG ACGAGTCCGG CGCTGTCGCC
GGCGTTCGGG CCGTGGGTCC GGAGGGCGAA CTGGAGATCC AGGCGTCGCT CACCGTCGCC
TGCGACGGCC GAGATTCGGC GGTACGCCGG GAACTCGGCC TGAAGCCCGT CGAGTACGGC
GCACCCATGG ACGTACTGTG GTTCCGGATC TCGCGCCAGG CAGACGACGG CGACGGCCTG
GCGATGCGGA TCGGCGCCGG AGGGCTGATG CTCGCCGTCG ACCGCGGCGA CTACTACCAG
TGCGCTTACG TCATCGCCAA GGGCGGCTAC GACAAGATCC GCGCAGCCGG GCTGGAGGCG
CTGCGGAAGC AGGTGACCCG GCGACACCCG ACCCTCGCCG ACCGGGTCGG CGAGCTCGCC
ACCTGGGACG ACGTCAAACT GCTGACGGTG AAGGTCAACC GGCTCAAGCG GTGGCACGCA
CCCGGCGCGC TGCTCATCGG CGACGCCGCG CACGCCATGT CCCCGATCGG CGGCGTCGGC
ATCAACCTGG CAGTACAGGA CGCCGCGGCC ACCGCCCGGA TGCTGGGTCC AAAGCTCGCC
ACCGGGCAGC CAGTGACCGA AGCGGACCTC GCCGCAGTGG AGAAGCGCCG GCGTTTGCCG
GCGGTGGTGA CGCAGAACAT CCAGCGTGCC GCGCAGCGAC GCGTCGTCGA CCCCCTGCTG
CACACCACCG GCCGGGTCGA GGCCCCGGCG CCGATCCGCC TGCTGCAGCG GATCCCGGCG
TTGCAAGCCC TCCCCGCCCG ACTCGTCGGC ATCGGCGTAC GCCCCGAGCA CCTACGCTGA
 
Protein sequence
MKTTVCVVGG GPAGLVLGLL LARQGVAVTV LEKHADFLRD FRGDTVHPST LNMLDEIGLG 
ERMAALRGRK AGALRATFDD GTYAIVDFTR LPVPHNYLYF VPQWDFLEML ATEAARLPTF
TLLRSATVTG LLRDESGAVA GVRAVGPEGE LEIQASLTVA CDGRDSAVRR ELGLKPVEYG
APMDVLWFRI SRQADDGDGL AMRIGAGGLM LAVDRGDYYQ CAYVIAKGGY DKIRAAGLEA
LRKQVTRRHP TLADRVGELA TWDDVKLLTV KVNRLKRWHA PGALLIGDAA HAMSPIGGVG
INLAVQDAAA TARMLGPKLA TGQPVTEADL AAVEKRRRLP AVVTQNIQRA AQRRVVDPLL
HTTGRVEAPA PIRLLQRIPA LQALPARLVG IGVRPEHLR