Gene Sare_3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3097 
Symbol 
ID5706571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3518973 
End bp3520616 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content62% 
IMG OID641272532 
Producthypothetical protein 
Protein accessionYP_001537900 
Protein GI159038647 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0360838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TATTAGCGCT CACCAAAGTT GGCGCCGGAT GCGCGCTGGT CGCAATGGCC 
GTGGTGACTG GTGTCCAGCC GGCATCGGCG GAGCCCCAGC CCGCCCCTGC CGTATCACAG
CTGGTGAACT ATGAGACGGT CGCTTGCGAC CCAACTGGGA CGACCGCCGC AGACGCCGCG
CTGGCCAGTC AACTCAATGC CGTACTCACC GCAGATATGC GTGGCTACAT GACAGCGTAC
AGAACGTCGT GTGCACGCAT GGTAGTAGCG GCGGTCAAGG CCCGAGGACT CTCACCCCGA
GCGGCGGTGA TCGCGGTCAC CACGGTTATT GTGGAAACCC ACCTCCGAAA CATTAGCGAA
GAGGTAGACC ATACAAGTCT CGGGTTGTTC CAGCAGCAGG AATGGTGGGG TTCTCGGGCG
GAGCGGCTAA ACGCAACCTG GGCTACCAAC AAGTTCATCA GCGTTATGCA TAATAAATAC
CCGGACGACT CATGGATGAC CGCCCCGATC GGCGAAGTCT GCCAGGCAGT GCAGGTGTCT
GACTTTCCGG ACCGCTACCA GGATCAGGCT GATGACGCAC AAACCATCGT GGACGCTTTA
TGGGTGCCTA CGGTCGGTGC GCCGGATGCG GTGTCGCGGG ATGGGGTGGT GGTGTCGTCG
TCGGGGCGGA TTTCGGTGTA TGCGGTGCGT GCTGATGGTG ATGTGTGGGG TCGTAGTCAG
GAATCGCCGG GTGGTTCGTT CAATGCGTGG CAGCGTTTGT CGACCGGTGG TGGTTTTGCT
GGTCAGGTAG CGGTGTTGCG GGATGATCGT GACCGGGTGG CGTTGTATGC GCGGCGGAGT
GGGACGATAT TCGGGGCGAG TCAGCAGGAA GTTGGTGGAT CGTTTGGTGT GTGGGGTCCG
ATCGGTACGA ACGGTGCGGG GGTGACGGGG GATCCGCGGG CGGTGTATGC GTCTGAGGGG
CGGATCGCTA TCTATGCGAC GACGAGTAGT GGGAATGTGT CGGGAGTGAC GCAGACGCAG
GCTGGTGGTG GGTTCGGTTC ATGGCAGCAG TTGACCAGTG GTGGTGGCTA CATGGGTAAG
CCAGCGGCGG TGGTGGATTC TCAGCAACGG GTGGCGTTGT ATGTGCGTCG GAACGGCATG
GTCTATGGGG CCAGTCAGTC GCAGGCTAAC GGTTCATTTG GGACGTGGGC TGCCCGGGGT
GTTGATGGTG CGGGTGTGGC CAGTGATCCG GTGGCGGTGT ATGGGGTCGG GGGTAGGATT
GCTATTTATG TCACCAGCAC TGCGGGGAAC GTTGCTGGGG TCAATCAGGT AGCCGCTGGT
GGTGAGTTCG GTGCTTGGCA GGTGTTGACC AGCACGGGTG GGTATGAGGG CCGGCCGGCG
GTGTTGGTTG ACGAGCAGGG TCGGGTAGCG GTCTACGTGC GTCGAAGTGG CGCGATCTAC
GGCGCTAGTC AGCCCGAGGC CGGTGGTCCG TTCGGTGCCT GGGCTGCTCG TGGCACCGGT
AGTCCCCAAC TCATCGGTGA TCCCACTGCT GTGTATGGCG TTGGTGACCG AATCGCCCTG
TATGCCGCCG CTACCAACGA CAGTATCGGC GGTGTTAGCC AGGGCGAAGC CGGCGGCACC
TTCGGCAACT GGATCGTCCT TTGA
 
Protein sequence
MKKLLALTKV GAGCALVAMA VVTGVQPASA EPQPAPAVSQ LVNYETVACD PTGTTAADAA 
LASQLNAVLT ADMRGYMTAY RTSCARMVVA AVKARGLSPR AAVIAVTTVI VETHLRNISE
EVDHTSLGLF QQQEWWGSRA ERLNATWATN KFISVMHNKY PDDSWMTAPI GEVCQAVQVS
DFPDRYQDQA DDAQTIVDAL WVPTVGAPDA VSRDGVVVSS SGRISVYAVR ADGDVWGRSQ
ESPGGSFNAW QRLSTGGGFA GQVAVLRDDR DRVALYARRS GTIFGASQQE VGGSFGVWGP
IGTNGAGVTG DPRAVYASEG RIAIYATTSS GNVSGVTQTQ AGGGFGSWQQ LTSGGGYMGK
PAAVVDSQQR VALYVRRNGM VYGASQSQAN GSFGTWAARG VDGAGVASDP VAVYGVGGRI
AIYVTSTAGN VAGVNQVAAG GEFGAWQVLT STGGYEGRPA VLVDEQGRVA VYVRRSGAIY
GASQPEAGGP FGAWAARGTG SPQLIGDPTA VYGVGDRIAL YAAATNDSIG GVSQGEAGGT
FGNWIVL