Gene Sare_3013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3013 
Symbol 
ID5707354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3426344 
End bp3427444 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID641272460 
Producthypothetical protein 
Protein accessionYP_001537828 
Protein GI159038575 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CGAAACGGAT CCTGGTCACC GGGGCGGCCG GCCTGGTCGG CGCCGAGGTC 
TGCGCCCGCC TGCTGAAGGC CGGGCACCGC GTGTCCGGCC TGGTGCACCA CACCCGAGTG
CTGATCGCCA ACAGCGGCCG GGGCGTCGCG TCGTCGACGG ACGGCAGGGC GGGCACCGTG
CGGCTGGTCA CCGGTGACGT CACCGTTCCA CGCCTGGGCC TCGACGACGA CACCTGGACG
GACCTGGCGA ACGGGCTCGA CCTGATCGTG CACTCTGCTG CGATCACCGA CTTCGGTCAC
CCCCGGGAGG TCTACCAGGC AATCAACACG ACCGGCACGG CCCACGTGCT GGAGCTGGCC
CGCGAGCGAT CCACCCCGCT GGTGCACGTC AGCACCGCCT ACGTGTGCGG CGAGCGGGAC
GGCAGGATCC TGGAGTCCCA ACTCGACGTC GGGCAGCGGT TCGGCAACCC GTACGAGGAG
AGCAAACTCG CCGCGGAGCA ACTGGTGCGA AAGGCCGCCG CGGAAAGCCT GCCGACCGCG
GTGATCCGGC CCAGTGTCGT CGTCGGTGCC GCCCGCACCG GCGCGGTCCG TGACTTCAAG
AACCTGTACG TCGTGCTGAA GCTCCTGTCC GAGGGTCGAA TCGGCTCGAT CCCGGGCTAC
TTCGACGCCT GCGTCGACCT TGTGCCGGTC GACCATGTGG CGGCGTTGAT CACCGCCGTG
GCCGACGACT TCGACCGGGC CAGCGACCGC ACACTGCACG CGGTGGGATC GTCGGTGCGG
ATGCGCCACA TCTCCGATGT GCTCGCCGAG TACCCGTCGT TCCACGTTCC CCGCTACATC
GCCCCGGCCA ACTTCGACCC TGCCATGCTG TCGGACCTCG AACGCGCCTA CTGGCACCGG
GTCATGTCGC TGTACGAGAG CTACTTCCGC CGCCAGCAGC ACTTCGACGA CACCGTCGCG
GCTGGGTTCC GGGGGCCCAA GCGACTACCC AGCGGTCCGA ACCACCTCCG GCGGATCATC
GACTACGCGG TCAGGGCCGG CTACCTGGGC GCGCCCCTAC CGGGCGTGAC GGAGGCCCTG
AGCCGGGTCA ACCAGCGGTG A
 
Protein sequence
MSKPKRILVT GAAGLVGAEV CARLLKAGHR VSGLVHHTRV LIANSGRGVA SSTDGRAGTV 
RLVTGDVTVP RLGLDDDTWT DLANGLDLIV HSAAITDFGH PREVYQAINT TGTAHVLELA
RERSTPLVHV STAYVCGERD GRILESQLDV GQRFGNPYEE SKLAAEQLVR KAAAESLPTA
VIRPSVVVGA ARTGAVRDFK NLYVVLKLLS EGRIGSIPGY FDACVDLVPV DHVAALITAV
ADDFDRASDR TLHAVGSSVR MRHISDVLAE YPSFHVPRYI APANFDPAML SDLERAYWHR
VMSLYESYFR RQQHFDDTVA AGFRGPKRLP SGPNHLRRII DYAVRAGYLG APLPGVTEAL
SRVNQR