Gene Sare_4339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4339 
Symbol 
ID5706615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4904802 
End bp4907264 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content75% 
IMG OID641273761 
Producthypothetical protein 
Protein accessionYP_001539111 
Protein GI159039858 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT CACTCGCCGA CCACCTGCGC GGACTGCCCG ACCAGTCACT CGCGGCACTG 
CTTCAGCAGC GGCCGGACCT CGTCGTACCC GTACCCGCCG ACGTCTCGGC GCTCGCCATC
CGAGCCCAGT CCCGGGTGTC GGTGGCCCGC GCCCTGGACA GCCTGGACCG GTTCACGCTG
CAGGTCCTGG ACGCCGCCCG GCTGACCCAG GACGCCTCCA CCGGCGCCTC CAGCATCGAC
GCGATACTTG CCATGACCAC CGCGAGTGCG CGACCGCCCG CCGCGACCGC CGTCCGCGCT
GCGGTGCAAC AACTCCGGTC GCGCCTGCTG CTGTACGGGC CCGACGAACG CCTGCATCTG
ATCGGCGGAA TCGACGAGAT CTCCCCGTAC CCGGCAGGGC TCGGCCGGCC GGCGGCGGAG
CTGGACGCGC GAGCGGCGGC CGTCTGCGAC GACTCGGCGA AACTTCGGCG TACCCTGCTC
TCGGCTCCGC CGTCTGCCCG GGCCATCCTG GACCGGCTCG CCGCCGGTCC GCCGGTGGGC
AGTGTGCCGC CGGGGGCGTT GCAGGCCCCG CCCATCGGCG CCGAGGACGC GCTCCCGCCG
GACCCCACCA ACGGCGGTAC GCCCACCGGC TCACCGGTGC GCTGGTTGGT TGACCACCGG
CTACTGGTGT CGATCGCCAA CGGTGGCGGC AGCGCACCCG GCACCGTCGA GTTGCCCCGG
GAGGTGGGCC TGCTGCTCCG CCGGGACACC GGCCCACTTG GCCCGATGCC CACCGATCCG
CCACCGGTTT CCGCCGCACC CCGGGAACCG AAGGCGGTCG ACTCGGCCGG GGCCGGTCAG
GCCATGGAGG TGGTCCGCCA GGCCGAGGCA CTGCTGGAGC AGCTCGCCGT GGAGCCGGCC
TCGGTGCTCC GCTCCGGTGG CCTGGGCGTT CGCGACCTCC GCCGGCTGGC CCGCGCCCTC
GGTCTGGACG AGCCGACAGC CGCTCTGCTG CTGGAGGCGT CACACGCGGC CGGACTCACC
GGGGAACTGG AGGTGCATGG GGCAGCGACG ACGCGTCCCG GCGGCGAGCA GCAGGTGGTG
CCGAGTGCCG GGTACGAGGT CTGGCGGGCG GGGTCGCTGG CGCAGCGGTG GGAGCAGTTG
GCCCGGGCGT GGCTGACGAT GACCCGGCAA CCCGGGCTGA TCGGCCAGCG CGACGACCGG
GACCGCCCGA TCACGGTGCT CTCCGCCGAG GCGGAACGGG CTGGGGCACC GATCGCCCGG
CAGGCAGCAC TGAGCGTCCT CGCCGATCTC GAACCGGCGA GCGCGCCCAC CCCGAACGAG
GTGTTGGACC TGTTGGACTG GCGGGCTCCC CGGCGCAGCC GGGGCCGGGA GGCAGCACAC
CGGGAGGTCC TGGCCGAGGC GGCCACGCTG GGCGTGACCG GGCTCGGAGC TCTCACCTCG
TACGGGCGGC TGCTGCTCGG CGACACAACG CCCGGCGAAC AGCGGGGCGT CGATGATCCG
CTGGGCCTGC GCGGCGACAC TGAACCGTCG ACGGCGGTGC GGGCGCTGGA CGGGCTGCTA
CCCGCCCCGG TCGACCACTT CCTGGTGCAG GCCGACCTGA CCGTCGTCGT ACCCGGCCCG
GCCGACCCGA CACTCACGGC CGAACTGGAC GTGGTGGCCG AGCACGAGTC AGCCGGTGGG
GCCAGCGTGC ACCGGGTCAC CCCGGCGAGT GTCCGGCGGG CGCTGGATGC CGGCTACTCC
GCGGACGACC TGCACGGGCT GTTCACGCGG CGCTCGCGGA CCCCGGTGCC GCAGGGGCTC
ACCTACCTGG TCGACGACAC CGCCCGCCGG CACGGCGGCC TTCGCGTCGG CGCCACCGGG
GCGTACCTGC GCAGCGACGA CGAGGCGCTG CTCGGCGAGG TACTCGCGGA CCGGCGGTTG
GAGGCGCTGG TGCTGCGCCG GCTCGCACCG ACCGTGCTTG TCACGCCGTG CCAGGTCAAC
CGGATGCTCG TCACGCTGCG TGAGGCCGGC TATGCGCCGG TGCCCGAGGA CGCCACTGGC
GCGGCGGTAC TCACCCGGCC CAGGTCCCGG CGGGCGCCCG CCCGCGTGCC GGCCGCCACC
CGGATCCTGG ACCCGCTCGC CGTCCCGAAG CTCCCGATGC CCCGGCTGCT CGGCATGGTC
GAGCAGATCC GGCGGGGCGA AGCCGCCGCC AGGGCCGCCC GGCGGGCACC GGCGGTGGTC
CGTGGTCAGG AAGCCGCCGG CGCGGTACAC AGCCACACCG ACGCGCTGGC GGTGCTGCAG
CAGGCGGTCC GGGACAAGGT GCTGGTCTGG GTGGGGTACG TCGACGCGCA CGGGGCGACC
GCGTCCCGAC TGGTCCGGCC GGTGTCGATC GGAGCCGGCT ACCTACGGGC GGAAGACGAA
CGGACCGAGA TGCTGCACAC GTTCGCCCTG CACCGGATCG CTGCGGCGGT TCTGGCGGAC
TGA
 
Protein sequence
MTTSLADHLR GLPDQSLAAL LQQRPDLVVP VPADVSALAI RAQSRVSVAR ALDSLDRFTL 
QVLDAARLTQ DASTGASSID AILAMTTASA RPPAATAVRA AVQQLRSRLL LYGPDERLHL
IGGIDEISPY PAGLGRPAAE LDARAAAVCD DSAKLRRTLL SAPPSARAIL DRLAAGPPVG
SVPPGALQAP PIGAEDALPP DPTNGGTPTG SPVRWLVDHR LLVSIANGGG SAPGTVELPR
EVGLLLRRDT GPLGPMPTDP PPVSAAPREP KAVDSAGAGQ AMEVVRQAEA LLEQLAVEPA
SVLRSGGLGV RDLRRLARAL GLDEPTAALL LEASHAAGLT GELEVHGAAT TRPGGEQQVV
PSAGYEVWRA GSLAQRWEQL ARAWLTMTRQ PGLIGQRDDR DRPITVLSAE AERAGAPIAR
QAALSVLADL EPASAPTPNE VLDLLDWRAP RRSRGREAAH REVLAEAATL GVTGLGALTS
YGRLLLGDTT PGEQRGVDDP LGLRGDTEPS TAVRALDGLL PAPVDHFLVQ ADLTVVVPGP
ADPTLTAELD VVAEHESAGG ASVHRVTPAS VRRALDAGYS ADDLHGLFTR RSRTPVPQGL
TYLVDDTARR HGGLRVGATG AYLRSDDEAL LGEVLADRRL EALVLRRLAP TVLVTPCQVN
RMLVTLREAG YAPVPEDATG AAVLTRPRSR RAPARVPAAT RILDPLAVPK LPMPRLLGMV
EQIRRGEAAA RAARRAPAVV RGQEAAGAVH SHTDALAVLQ QAVRDKVLVW VGYVDAHGAT
ASRLVRPVSI GAGYLRAEDE RTEMLHTFAL HRIAAAVLAD