Gene Sare_5044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5044 
Symbol 
ID5707315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5711135 
End bp5712244 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content75% 
IMG OID641274437 
Producthypothetical protein 
Protein accessionYP_001539778 
Protein GI159040525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0151406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAA CCGACCTACC CCGGCTCTCC GTTCGTTCCT GCGCGGACCT GATCGCCGCC 
GTGCCCTACC TGCTCGGGTT CCACCCGGCC GACAGCGTGG TCGTGGTGGC CATGCATGGG
ACACGGATCA CCTTCGCCGC GCGGGCGGAC CTCCCAGACC GGGACGGGTC CGCCGATCCG
AGCGGGTCCG CCCGGCCGAA CACCGAGTCT GCCGACCCGG GTGGATCGGA CGAGCGCGCC
GCGGCGGCAC GGCACCTGGC GGCGGTGGTC GCGCGGCAGA CGACCGACCG GGCGACCGTG
CTCGGCTACG GACCCGCGTC CCGGGTCACC TGTGCGGTGG ACGCTGTTCG GCAGGCCCTC
ACCGAGGCCG GGATCCTGGT GCTCGACGCG CTCCGGGTCA CCGATGGTCG CTACTGGTCG
TACCTCTGCC AGGCGCCGGC ATGCTGCCCG CCCGACGGCA CTCCCTACGA CTCGGGTACG
AGCCAGGTGG CCGCCGCCGC GGTTCTCGCC GGTCAGGTCG CCCTGCCCGA CCGGGCCGCC
CTCGTCGCGC AGGTGGCACC GGCAGGGGGT ACCGAGCAGG TTCGGCTGCA GCGGGCCGCC
GAGCGGGCGC GGCGGCGGTT CGCCGGACTG GTGACCCCGA GGACCGGGGG CGACGTTCCC
CGCGGGCGGG CGGTGCGGGC AGCGGGGAAC ACCGCGATCC GGGCCGCGCT GCGCCGATAC
CGGCGGGGCG AACGGCTCGA CGACGACGAG GTGGCCTGGC TGAGCCTGCT GCTGACCGAC
CCGACGGTCC GGGATCTCGC CTGGGAACGC ACCGATGGGC GAGACGCCGA CAAAGCTCTC
TGGGCCGACG TGCTCCGCCG GGCGCAACCG GACCTCATCG CCGCGCCCGG TTGCCTGCTG
GCATTCGCGA CGTGGCGGGC CGGGCACGGG GCGCTGGCGG TGGTGGCGGT GCAACGGGTG
CTCGCCCAGC AGCCCGATTA CCCGCTCGCG CTGCTCCTGG ACGACCTGCT TCGGCGTGGC
GTGCCGCCGA CGCGCCTGGC CGGATGGCCT GCCGTCCAAC TGCCCGGTGC GGTTCGTCCC
CGCCGTCGAC GCGGGCGCGG TGCCCGCTGA
 
Protein sequence
MTSTDLPRLS VRSCADLIAA VPYLLGFHPA DSVVVVAMHG TRITFAARAD LPDRDGSADP 
SGSARPNTES ADPGGSDERA AAARHLAAVV ARQTTDRATV LGYGPASRVT CAVDAVRQAL
TEAGILVLDA LRVTDGRYWS YLCQAPACCP PDGTPYDSGT SQVAAAAVLA GQVALPDRAA
LVAQVAPAGG TEQVRLQRAA ERARRRFAGL VTPRTGGDVP RGRAVRAAGN TAIRAALRRY
RRGERLDDDE VAWLSLLLTD PTVRDLAWER TDGRDADKAL WADVLRRAQP DLIAAPGCLL
AFATWRAGHG ALAVVAVQRV LAQQPDYPLA LLLDDLLRRG VPPTRLAGWP AVQLPGAVRP
RRRRGRGAR