Gene Sare_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4056 
Symbol 
ID5704139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4613574 
End bp4614617 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content70% 
IMG OID641273482 
Productfatty acid desaturase 
Protein accessionYP_001538837 
Protein GI159039584 
COG category[I] Lipid transport and metabolism 
COG ID[COG3239] Fatty acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.403518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.268504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGGC AACTCGTGGC GGATCCGCCG GTACGGGGCA GTGACTACGC GGAGTTGTCT 
CGGCGGATCC GTGCGGCAGG TCTGTTGGAG CGCCGGCCAG GCCGGTACGC GGTTCGGATC
GCATCGACCG CCGCGTTCTT CGGCGCGGCC TGGGCCGTGG TCGTGCTCGT CGGCGACTCC
TGGGGCCAGG CCGCGGTCGC GGCGCTGATG GCGGTGGCCA GCACCCAGGT GGCCTTCCTC
GGCCACGACG CCGGGCACCG GCAGATGTTC CGCCGGCGTG GGCCGAGCGA ACTGGTCGGC
CTACTCGCCG GCAATCTGGC GGTCGGGCTC AGCTACGGAT GGTGGGTGGA CAAGCACAAT
CGGCACCATG CCAATCCCAA CCATGCCGAC GAGGACCCGG ACGTGGGGGC GGGCGCGTTG
GTCTGGACGA CGGAACAGGC GCGGGCGACC CGGGGATTCG CGCGATGGCT GGCCCGTCGG
CAGGCCGTTC TCTTCTTCCC GATGCTGCTG CTGGAAGGGC TGAACCTGCA CGTGTCGAGT
CTGCGCGCCA TCGTCGGCCG GGAGCCGGAC GGGCGGTTCC GGACGCCGAT GCGGCACCGG
GGTGTCGAGG CCCTGCTGCT CGCCGTGCAC ACCATCCTCT ACGCCGGTGG CCTGCTGCTG
GTGATGTCGC CGGGGAAGGC ACTGGTGTTC GCGGCCGTGC ACCAGGGACT GTGGGGGCTG
TACATGGGGT GTTCCTTCGC CCCGAACCAC AAGGGTATGC CGATGCCGAC CGTTGCGGAC
GACCTGGACT TCCTGCGTAA GCAGGTGATC ACCGCACGTA ACGTGCGAGG TGGCCGAGGT
GTCGACGCCG CGCTGGGCGG GCTGAACCTT CAGATCGAGC ACCACCTGTT CCCGAACATG
CCGCGCGGGA ACCTGCGCCG GGCCCGACCG GTTGTCCGGG CCTACTGCGC CGAGCGGGGC
GTTCCGTACG TGGAGGTCGG GTTGGTCGAG TCGTACCGGC AGGCGCTCGC GCACCTGCAC
GCAGTCGGCC GCCCACTACG TTGA
 
Protein sequence
MTRQLVADPP VRGSDYAELS RRIRAAGLLE RRPGRYAVRI ASTAAFFGAA WAVVVLVGDS 
WGQAAVAALM AVASTQVAFL GHDAGHRQMF RRRGPSELVG LLAGNLAVGL SYGWWVDKHN
RHHANPNHAD EDPDVGAGAL VWTTEQARAT RGFARWLARR QAVLFFPMLL LEGLNLHVSS
LRAIVGREPD GRFRTPMRHR GVEALLLAVH TILYAGGLLL VMSPGKALVF AAVHQGLWGL
YMGCSFAPNH KGMPMPTVAD DLDFLRKQVI TARNVRGGRG VDAALGGLNL QIEHHLFPNM
PRGNLRRARP VVRAYCAERG VPYVEVGLVE SYRQALAHLH AVGRPLR