Gene Sare_4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4838 
Symbol 
ID5707743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5487475 
End bp5488881 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content69% 
IMG OID641274234 
Producthypothetical protein 
Protein accessionYP_001539579 
Protein GI159040326 
COG category 
COG ID 
TIGRFAM ID[TIGR02958] secretion protein snm4 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000246371 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCACCG TACTGGATGG ACGGCTGTGT CGAGTCACCG TGACCGGCCC GGATCGCAGG 
GTCGACCTGG CAGTGCCGGT GACGACTCCG GTGGCGACGC TGCTGCCGGT GTTGTTGGGG
CACACCTCGG AGGGACACCG CCTCGAGGGC GACACACCCG AGGCCGCGTG GGTCCTGCAG
CGTCTCGGCC AGGAGCCCTT CGAGCTGTCC GGTACGCCGG AGAGTCTCGA CTGGTTGGAA
GGCGAGGAGC TGTATCTGCG CCGGGCTGAG GACCCACTGC CCGAGCTGGA CTTCGACGAC
GTCGCGGAGG GCATCGCGAC CGTCGTCAAC CGACGTGGGG ACCGATGGCA GCCGGAGTAC
CGGCGGGCGC TGTTCCTCCT GCTGTCGGTC GTCGCGATGG GTGCGATCGC CGTGCTCCTG
GCCGATTGGC GTCCGGTGCC ACACCAGGTG GTGGCCGCCG GAACGGTTGG CGTGGCCTTC
CTCGCCGCCG CGCTCCTCTT CGCGCACAGG CACAGCGACG GCGCCTTCTC ACTCCTCTTC
GGTGGTGGTG CGGCGGCGTT CGCGGCGCTG GCCACGTCCA GCGCGGTCGA CGGGGATCCG
CAGGGGATAG CGATCCACGG GTCGTCGGTG CTGGCAGCGG CGGTCGGCGC GGTCGTCGTC
TCCGGGGTGC TGGTCGTGGC CCAACGCACG GTCAGTCCGT ACCTGCCGTT CACTCCGATG
CTGGTGGTCG GCGTGGCCGC GTTGATGGCG CTCGGGGTGC TGCTGCTTCA GTCCGGGTCC
GGGATGTCGG TGCAGCGGAC GGCGGCTGCG GCCGCCGCGG TGGTGTTCGC CGCTGTCGTG
GTCGCGCCAC GGGTGACGGT GAAGTTCGCC CGGCTGCGGG GCCCGCAGCT ACCGAAGACC
GGCGCCGACA TGTCGTACGA CATCGAGCCG GTGCCGTCCG AGACCGTCGG CGAGCGGACC
AACGACGCCG ACACCTACCT CAGCGTGCTG ATGCTGGCAT CGGCGCTGGT ACTGCCCATT
CTGTTCCATC TCACGCTCCA GGAACCGGGC TGGACGGGCT GGACCTACGT GTTCGTGATC
GCCAGTGCAG TCTTTCTGCG CGCGCGTACC TTTCTCGGTC TCTGGCAAAG GGTTCCGCTC
ACTGTCGCCG GTACCGTCGG CTACCTCATG GTCATCATGC ACCTCTCCCA GACGATGTCC
GTCGGCTGGC GGTGGGCTCT GCTGGGCAGT CTGGTGACGG TGGTGATCCC ACTGGTGCTG
GCGGCCCTGC GGCCCTGGCC CCGCCGAATG TTGCCGTTCT GGGAGTACAC GGCGACGTTC
TTCGATGTGG TGACCGGGGT GGCCGTCCTA CCGATCCTCG CTCAGATCCT CGGGCTGTAC
GCGTGGGCCC GCGGGCTCTT CGGGTAG
 
Protein sequence
MSTVLDGRLC RVTVTGPDRR VDLAVPVTTP VATLLPVLLG HTSEGHRLEG DTPEAAWVLQ 
RLGQEPFELS GTPESLDWLE GEELYLRRAE DPLPELDFDD VAEGIATVVN RRGDRWQPEY
RRALFLLLSV VAMGAIAVLL ADWRPVPHQV VAAGTVGVAF LAAALLFAHR HSDGAFSLLF
GGGAAAFAAL ATSSAVDGDP QGIAIHGSSV LAAAVGAVVV SGVLVVAQRT VSPYLPFTPM
LVVGVAALMA LGVLLLQSGS GMSVQRTAAA AAAVVFAAVV VAPRVTVKFA RLRGPQLPKT
GADMSYDIEP VPSETVGERT NDADTYLSVL MLASALVLPI LFHLTLQEPG WTGWTYVFVI
ASAVFLRART FLGLWQRVPL TVAGTVGYLM VIMHLSQTMS VGWRWALLGS LVTVVIPLVL
AALRPWPRRM LPFWEYTATF FDVVTGVAVL PILAQILGLY AWARGLFG