Gene Sare_3134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3134 
Symbol 
ID5706344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3564386 
End bp3565648 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content65% 
IMG OID641272566 
Producthypothetical protein 
Protein accessionYP_001537933 
Protein GI159038680 
COG category[S] Function unknown 
COG ID[COG4325] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.142583 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000406172 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTGTGGG CAAGGTTCTT CCGGATCCGC CAGAATCTGG TGGGCAGCCT GTGGTTCTTT 
CCGCTACTCG GGCTGATCGC AGGTGGGCTG TTCGCCCTGC TGGTCCGCGC GGTCGATGCC
GTCGTCACCC TGCCGTCGGC GTGGCATGTC GAGCCGTCCT CGGCGGAGGA ACTCCTCACC
ATCGTGGTCG AGGTTGCCGG TGTTCTGGCC GGATTTGTGG TGGCCGTGAG CGTTGTCGTG
CTGGAACTCC ACGCCGACAG CTTTCCTCGC TACCTACGCC TGTTGTATCG CGACCGATTC
CACCTGTCCG TCGTGACGGT CCTCCTCGGG ACCGTTGCCT TTGCCTTCAC GTTGCTGGTC
ATCGGCGGGA CATCCGAAAT CCCGCAGCTC GGTGTTGGGA TCTCCGGGAT TCTGCTCATC
GTCAGCTTTG CGCTGTTCGT GGTTTTCCTC AACCACCTGG TGCATCAGCT GCGGCCGGTG
GCAGTCGCGG CCGAAGTGAC CAAGACCGCC CAGAAGCTGA TCTTGTCACG TGAGCAGTCG
GGCGCGTCGA TGGCGCCCGA GGAGGAGCCG ACCGAGGAGC CGACACTGGT GATTCGGGCC
CGACGGTCCG GATCGGTGCA GGCAGTGAGC GAGGCAGCCC TGGTGCGGTG GGCGAGCCAG
CGCGACTGCC GCCTCCGATC GAAAGTGGCG GCAGGCGACT ACGTCATCGC CGGACAGCCG
GTGATGGAGA TCTACGGAAA GCGTCCGTCG AGCGATTCGA TCGAACAGAT ACACGCCATG
GTGGCGATGG GGATCGAGCG AACCATCGAG CGGGATCCCG CGTTTCCGTT CCGGATTCTG
GTCGACAGTG CGGCGCGGGC GCTCTCGTCT GCCATCAACG ATCCGACCAC GGCAGTACAG
ATGCTGGACT ACATCGAGGA ATTGCTTCGG ACGATTGCGG CAAAGCCCCT CGGCGCAATG
GCCTACGCCG ATGAGTCCGG CCAGTCACGG CTGGTCATGC CAGGGCGGAC GTGGGAGGAC
TATCTCACCC TGGCGGTCAC GGAGATTCGG GAGTACGGGG CCAGTTCCAT CCAGGTGATG
CGGCGGCTAC GCGCCATGCT CGAAGATCTC CGTGAGGTGA TCCCCGACGA ACGGAGACCG
GCGGTCGAGG CCGAACTCTC CCGGCTTGAT CAAACCCTGG CCGCAAGCTT CGGAGGTCAG
GTCGACCACG ACCGTGCCAC GGTGCCGGAC CGGCAGGGAA TCGGCGGTCC CGGCCGCGGG
TGA
 
Protein sequence
MLWARFFRIR QNLVGSLWFF PLLGLIAGGL FALLVRAVDA VVTLPSAWHV EPSSAEELLT 
IVVEVAGVLA GFVVAVSVVV LELHADSFPR YLRLLYRDRF HLSVVTVLLG TVAFAFTLLV
IGGTSEIPQL GVGISGILLI VSFALFVVFL NHLVHQLRPV AVAAEVTKTA QKLILSREQS
GASMAPEEEP TEEPTLVIRA RRSGSVQAVS EAALVRWASQ RDCRLRSKVA AGDYVIAGQP
VMEIYGKRPS SDSIEQIHAM VAMGIERTIE RDPAFPFRIL VDSAARALSS AINDPTTAVQ
MLDYIEELLR TIAAKPLGAM AYADESGQSR LVMPGRTWED YLTLAVTEIR EYGASSIQVM
RRLRAMLEDL REVIPDERRP AVEAELSRLD QTLAASFGGQ VDHDRATVPD RQGIGGPGRG