Gene Sare_4537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4537 
Symbol 
ID5705978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5129715 
End bp5131535 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content69% 
IMG OID641273951 
Producthypothetical protein 
Protein accessionYP_001539300 
Protein GI159040047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.483456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0103293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAG CCGATTCGAG CCGCGCGGCC GACGCTCCCG AGGGCGGGGC GTCGACCACC 
GAGCCTGACC AGCGAAACCA GGCCACCACA CGGTGGCGGG TCGATCTGCT GGCGGTGCTG
AGCTTCCTCG CGCTCGCACT CTGGGTGACC CTCCGGCTCT GGCTGGACCC ACGCGACGGG
CTCCGGGACA ACCGCACCGA TCAGGCGCAG TTCGAGTGGA TGATGGCGCA CGGTTCACGA
GTGGTGACCG ATTTCGCCTA TCCCTTCGCC TCGGATCGAA TGAACGTGCC CGAGGTCGTC
AATTTGATGG CCAATACGTC CGTATTATCT GTATCTATAC CAATGACGCC GGTCACCCTT
GTGGCCGGAC CCCGGATGTC CTTCCTGCTC TTTCTCACCC TGGGGATGGC CGCCACCGCA
ACATCGTGGT ATTTCCTGCT GTCCCGGGTG GTGGTCCGGT CTCCCGGCCC GGCCTGGCTC
GGCGCCACAT TCTGCGGGTT CGCGCCCGCC ATGGTCTCGC ACGCCAACGC CCACCCCAAC
ATCGTCTCCC AGTTCGTGGT GCCACTGATC ATCTGGCGTA CCCTGCGCCT CGGTGAGCCG
GGCCGCTGGC TACGCAACGG GCTGCTGCTC GCCCTGGTGA TCGTCTGGCA GGCATTCCTC
AACCTGGAGA TCCTGCTGAT GACCGCGATC GGCCTCGGTG TGGTCATCGT CGCGCTTGCC
CTCGGTCGAC CCGACCTACG CCAGCGGACA CGCCCGTTCC TCGCCGGGCT GGGCGTCGCC
GCGGGAGTCA CGCTCGTCCT GCTGGCGTAC CCGCTGTACG TACAGTTCTT CGGTCCCGGC
GCCTACCGGG GGCTGTCACC CCTCATCCGC GGCTACTCCA CCGACCTCGC CTCGTTCGTG
GCGTACTCCC GGGAGTCGCT GGCCGGCGAC GAACCCGGTG CGAGAGGGCT GGCGAAGAAC
CCCACCGAGG AGAACGCCTT CTTCGGCTGG CCCCTGTTGG TGCTCGTCGC CGCACTCGTC
TGGTGGCTGC GCCGCAACGT CGTCGTCCGG GCCCTGGCCC TGCTCGCGGT GGTCTTCGCC
GTGCTCTCGC TCGGCCGGGA AGTCCTGTTC AACGGCGAGG CCACCGGTAT ACCGGCTCCC
TGGGCGATAC TGGAAACCCT GCCGATCCTG CACTCGGTGG TACCGACCCG CTGGGCCCTG
GCCATCACCC CGGTGATCGG GCTGCTGCTC GCGTACGGGG CACAGCACGC CCGCACCCTC
GCCACCCGGA ATCCGTCCGC CCGGCCACAG ATCCGCTTTG CCACGGTCAC CGTACTGGCG
ATGGCGCTCC TGCCGCTCCT GCCGACCCCA CTGCCGGCGG TCCGGCTGGA GCCCACGCCC
GCCTTCGTCA CCTCTGGCGC ATGGCGCCCC TACGTGGCCG GTGGTCGCAG CATCGTCACC
CTGCCGCTGC CGGACACCCA CTACGCCGAC CCGCTGCGCT GGTCGGCCGA GACAGGTCTG
GAGATGCCGA TCGCCCGGGG GTACTTCCTC GGCCCGGACA CCCGCCCCGA CCGGCACCGC
GTCGCCCTGT TCACCGCCCC AGACCGCCCG ACCAGCGACT TCTTCACCGA AATTCGGCGT
ACCGGTGAGG TGCCACCAGT CAGCCAGCAG GAACGAACGG CCGCTGAGGA CGACCTGCGG
TACTGGCGGG CCGGCGCGGT CGTGCTCGGT CCACACCGGC ACGCGGACGC GCTACGCCGC
GGCATGACCG AGCTGATCGA GGTCCAGCCG ACCTACACCG GGGGCGTCTG GCTCTGGGAC
GTGCGACACC TCACCGACTG A
 
Protein sequence
MTTADSSRAA DAPEGGASTT EPDQRNQATT RWRVDLLAVL SFLALALWVT LRLWLDPRDG 
LRDNRTDQAQ FEWMMAHGSR VVTDFAYPFA SDRMNVPEVV NLMANTSVLS VSIPMTPVTL
VAGPRMSFLL FLTLGMAATA TSWYFLLSRV VVRSPGPAWL GATFCGFAPA MVSHANAHPN
IVSQFVVPLI IWRTLRLGEP GRWLRNGLLL ALVIVWQAFL NLEILLMTAI GLGVVIVALA
LGRPDLRQRT RPFLAGLGVA AGVTLVLLAY PLYVQFFGPG AYRGLSPLIR GYSTDLASFV
AYSRESLAGD EPGARGLAKN PTEENAFFGW PLLVLVAALV WWLRRNVVVR ALALLAVVFA
VLSLGREVLF NGEATGIPAP WAILETLPIL HSVVPTRWAL AITPVIGLLL AYGAQHARTL
ATRNPSARPQ IRFATVTVLA MALLPLLPTP LPAVRLEPTP AFVTSGAWRP YVAGGRSIVT
LPLPDTHYAD PLRWSAETGL EMPIARGYFL GPDTRPDRHR VALFTAPDRP TSDFFTEIRR
TGEVPPVSQQ ERTAAEDDLR YWRAGAVVLG PHRHADALRR GMTELIEVQP TYTGGVWLWD
VRHLTD