Gene Sare_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1556 
Symbol 
ID5706758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1790675 
End bp1792309 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content74% 
IMG OID641271067 
Producthypothetical protein 
Protein accessionYP_001536443 
Protein GI159037190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0139202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCTCCG GCTTCGGTGA ACTGACTGAC CACGCGCATC ATCTGGTGGC TACCGGCGAC 
CTCGCCGGCG CCCAGCGGCT GCTCTCCGAC GCGCTGACCG ATGCCGATCC ACGGCCTGCC
CATGCCAGCG CCGAGTTGGC CGAGTTGGCG AGTCTGCAGG CGCGGGTGCT GGTCGCGCTC
GGCGACCCGC AGTCCGCGCG GGGCTGGGCG GCCTACGCGT ACGCGGCCAG CAACCACCTG
CACGGCCGTT CGGACGAACG TACGGTCGCG ACGGCCGCCA CCCTGGCCGC CGTGCTGCAC
CGGGTCGGTA GCTGGTCCCG GGCGGCGCGG GTCTACCAGG AGGTCATCAT CGAGCTGACC
GCCTTGGACG GCCCCGAGTC GCTGCGCGTA CTCGCCGCGC ACGCTGACCT GGCCACGATG
GAGTACGCCC GTGGCCACTG CCAGGCGGCC CGTGACCGGC TCGCCGACGC GTGGGAGCTG
CACCGCGAGG TGTACGGCGA TGGGCATCCC AGTGGCATCA AGATGCTGGC CCGGCTCGGC
GCGATGCAAC GCGACTGTGG GCTGTCCGGT TCCGCGCACG AGAGCCTGGC GCTGGCCGGG
GAACTGTGCC GGCAGCACCT GACCGCGGAC GACCCGCTCG CGGTGCAGGT TGCCGCGCTC
GGGCGGGCGG CGGCCGATCC GGCGCACAGC TGCGCCGGCG TCACACCGGA CGGGCGGGAG
GCTCCGATCG TGCCGGCCGC CCGCACGCCC CCGCCAGGGG ACGTGCCGCC CTACGATGCC
GAGCCGCAGC ATCCGTCGCA GCCCGGATAC CGACCCGCCG ACCCGTACCT GCCGCCGGAG
CCGGACCGGC CGGTGGTCAA GGAGCATCCG GCGGCCGGGT ATTCCCCGCC GCTCACGGTC
CCGACGCCCC GGCAACCGGT GGACGGTGCC GCGGTCGAAG AGTCGACCGA GTCACCCGAG
CAGGGTTCGG GCGGGGCGGA GCACGACCCG TGGCGGCGTG AGCCGTCGGC CGAGGAGTGG
GGCACCGCCG TGCCCCCGTC GGTGCTGCCG CTGGCTCACG GCGACGACGG TGGGCTGTCG
GGCTGGAGGG ATCTGGAGGA GGCCGACGGG GTCCGTCGGG TCGCGCCGCG GGAGACGCCC
GACGAGCCGG CGGACCTGCC GTCGCGGCTG CTGCCGGTGC CGGTGCCTCG TGCCTCGCCG
CCGTCTCGCA AGCGGCTGTT GCTGCTCGTG GCGGGTGGTG TGGTGGTGCT GCTGGGGACG
CTCGCGGTGA TCGCGGGGGT GTCCCGCTTC GCTGGGGCAG CGTCGGTGGC GACCAGCCCA
CCGGCCCAGG TCACCGCTAC TCCCGCCGCA TCCGCGGCGG CTGCCGGCAC CCCACCCGGT
GAGCTGACTC TGAGTGACAA CCAGGACAGT GTCGCGCTGC GTTGGACGTA TCCGGCGGGG
GGTGAGGGTC CGGTGGTGGT CTCGGGTGGC CAGCCCGGCC AGCCGCAGAC CGTTTTCGCC
AACCTACCCG CCGGCACCAC CGACTTCGTC GTGTACGGGC TCAACGGTGG CGTCGACTAC
TGCTTCGCCG TGGCGGTGGT CTGGTCGACG GAGACGATCG CCCGGTCGGG GGAGGTCTGC
ACCAACCGCG GGTGA
 
Protein sequence
MPSGFGELTD HAHHLVATGD LAGAQRLLSD ALTDADPRPA HASAELAELA SLQARVLVAL 
GDPQSARGWA AYAYAASNHL HGRSDERTVA TAATLAAVLH RVGSWSRAAR VYQEVIIELT
ALDGPESLRV LAAHADLATM EYARGHCQAA RDRLADAWEL HREVYGDGHP SGIKMLARLG
AMQRDCGLSG SAHESLALAG ELCRQHLTAD DPLAVQVAAL GRAAADPAHS CAGVTPDGRE
APIVPAARTP PPGDVPPYDA EPQHPSQPGY RPADPYLPPE PDRPVVKEHP AAGYSPPLTV
PTPRQPVDGA AVEESTESPE QGSGGAEHDP WRREPSAEEW GTAVPPSVLP LAHGDDGGLS
GWRDLEEADG VRRVAPRETP DEPADLPSRL LPVPVPRASP PSRKRLLLLV AGGVVVLLGT
LAVIAGVSRF AGAASVATSP PAQVTATPAA SAAAAGTPPG ELTLSDNQDS VALRWTYPAG
GEGPVVVSGG QPGQPQTVFA NLPAGTTDFV VYGLNGGVDY CFAVAVVWST ETIARSGEVC
TNRG