Gene Sare_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1349 
Symbol 
ID5704276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1556886 
End bp1558520 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content76% 
IMG OID641270860 
Producthypothetical protein 
Protein accessionYP_001536241 
Protein GI159036988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0457672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00027921 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGGGC TGCCCGGGGG TCGAGGGCCC GACGAGTTCG GCTGGCCGGG TTGGCCACCG 
CGTCGCCAAC GAGCCGGGCC TGACGGCCGC CCACTCGCCG AGGAGCTGGC CGAGGCCCGG
GACTCCCCCG ACGGCGAGGC CCGCAGTGCC GAGCTGGAAC GGATCGCCGC GCGGGCCGAC
GCCACCGGCG ACGACCGGTC GGCGCTGGAC GCCCGGCTCG CGTTGATCGA GACCTACCTG
CTCGACGGTG AACGCTGGCG CCTGGTCGAG CCGGTCCGCC GCTGCCTCGC CGCCGTCGAC
CGTCGACCCG ACCTGCTGCC CTCCGGCGGC CACGAAACCC TGTCCCGCTA CCGGCGGTAC
GCGGTCGAAG CCCTGCTCGG CAGCCCGCGC GGTGGGCTGG ACCAAGCCCG GGCACTGCTC
GACGACTTCG GAGCCGCCGA CAGCGCGACC GCCGCCGAAC TACACTGCCG GATCGCCGAC
CACCTCGGGG ACGAGCCGAC CGCCCGGTCC TGGTACGACC GCTGGGCAGC GGCGCCGGCC
GCACCAACCG CCGGTTGCGT CGGCTGCGCC GCCGTCCGGC GAGCGGAGCT GCTCGCCGGC
TGGGAGGACT GGTCCGCCGC GCTGGACGTC CTGGCCGACG TCGACCCGGA CGGCTGCACC
GGCGCACCCG AGCGAGGGCT CGCCGCGGGG ATGCTGCCGT GGCTGCGGGT CGGCGCGGCG
GAGCAGGCCG CGGCGGCGCA CGTGCGCGCG TATCAGCGGC ACCAGCACGA ACGGGCCGGC
TTCGCCTACC TCGCGGCGCA CCTGCGCTTC TGCGCGCTCA GCGGGAATCC GGTCCGCGGG
CTGGCCATCC TCGCCGCGCA GCTACCCAGG CTCGACGGGG CCCACGACGA GCTGGCCGCG
ATGGAGTTCG CCGCCGCGGG GGCGCTGGTG TGCGGGCTCG CGGTCGCGGC CGACCTCGGC
GAGCAACGCA TACACCGCCC GGTGTACGGG CAACGGCCGG CGGCCGACCT CGACGTCACC
ACCCTCGGCG CCACGTTGAC CGACCTGGCC ACCGGGATCG CCGGTAGCTT CGACGCGCGC
AACGGCACCG GCCACCAGTC CGGACGCGTC GCGTCCTGGC TGGACGAGCG GGGGACAGCC
GAACGGGCGC TGGCCGATCC GGTACCCCTA CCGTTCGAGG GCCCGGACGA CGACCGGCCC
GACAGCGACC CGGACGACCC GCCGGAGGAC CGGCCCGTCC CGCTGAGCCT GGCAGCGATC
ACCACAGCGC TGGACCGACG CGGCGACCAG TACGTGCTGG ATGCCACGGA CACCGTCGTC
GGACGCTGGG GCGAGGCACT GATCCAGATG CGGCGGGCGG GCGAACGCGG CGAGGTCCTC
CATGTCCGCG CCGTGGCCAG CCGCCGGCTG CCAGCCGACC GACGCGCCGA GGCGTACGCG
TTCTGCAACG CCTGGAACCA GGACCGGCTG CTGCCCACGG CGTACGTGCA CGACTCCGGG
GCCGAGCTGG TGCTGGCCGG CGACGTCAGC ACCGACCTGT CCCACGGGGT GGCGCCCACC
CAACTGGATG TGCTGCTGGC CGCTGCGGTG CGCACCGGCT CGGCGTACGC CGACGCGGTC
GCCGAGCTTC CCTGA
 
Protein sequence
MTGLPGGRGP DEFGWPGWPP RRQRAGPDGR PLAEELAEAR DSPDGEARSA ELERIAARAD 
ATGDDRSALD ARLALIETYL LDGERWRLVE PVRRCLAAVD RRPDLLPSGG HETLSRYRRY
AVEALLGSPR GGLDQARALL DDFGAADSAT AAELHCRIAD HLGDEPTARS WYDRWAAAPA
APTAGCVGCA AVRRAELLAG WEDWSAALDV LADVDPDGCT GAPERGLAAG MLPWLRVGAA
EQAAAAHVRA YQRHQHERAG FAYLAAHLRF CALSGNPVRG LAILAAQLPR LDGAHDELAA
MEFAAAGALV CGLAVAADLG EQRIHRPVYG QRPAADLDVT TLGATLTDLA TGIAGSFDAR
NGTGHQSGRV ASWLDERGTA ERALADPVPL PFEGPDDDRP DSDPDDPPED RPVPLSLAAI
TTALDRRGDQ YVLDATDTVV GRWGEALIQM RRAGERGEVL HVRAVASRRL PADRRAEAYA
FCNAWNQDRL LPTAYVHDSG AELVLAGDVS TDLSHGVAPT QLDVLLAAAV RTGSAYADAV
AELP