Gene Sare_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3938 
Symbol 
ID5703675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4481329 
End bp4482522 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content72% 
IMG OID641273363 
Producthypothetical protein 
Protein accessionYP_001538719 
Protein GI159039466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.789456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.200985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTG ACCTGTACCC GGATCAGACG GCGTTGCGGG TGCCGCAGGC GGGTCCGTCC 
GACGCGGTCT CGGCGGTGGC GCGGGCGTCG GATCGGGGGG CACGGATCGG CGTCGTGGTC
ATCGCGCTGT CCTGGCACTG CACCATCGGG CTACCCGCAA CGGAGAGGAT CCGGGGCGAG
TTGACCGCGC CCAGTGTGGT GCTCGGTACC TGGCTCCTGG TCACCGTCAC CGGCGTGGTC
ACCGGTGTCC GACTGCTACG CGGTAGGCCG CTACCGGCCT GGCCACTGGC CGCGCTGCTG
CTCGTCGTGG ACGTGACGGT GTTCGCCGCG GTCGGCCGGG AGCACATGTT CAGCAGCGTC
AACTGGGTGC GGGGGACGCT GGGCTGGTTC TTCGTCCTGA TCCTGTGGGA ACGACGGATG
ACCGCGCTGC TGGGCATGCT CACTGCGCAC GCCCTGATCG CACTGGCCGC GCTGCTCGCC
TACGGGATGA CCACTGCCGC GGACCTCGCC CGCTACACGA TGCACGTCTA CGGAGTCTCG
TCGCTGCCGG TGGCGGTCGC CGCCGGCAGC GCCGCGCTCG CCACTCTGGC CCGAAAGCGC
GCCCAGGTGG CCGCCACGGC GCATGCGCTG GCAGCTGAAC GGGAGGCCGC CGAGCAGGTC
CGACAGGAGC GGCGGGACCG GTTGGGCCGG GCGGGCGAAG CCGCCCGCGA GGTGCTGGCC
GAGCTGGCCG ACGGCCGGGC CGACCCGGCC GATCCGGCCG TACAGCGCCG ATGCGTCCTG
GCGGCTGCCC GGCTGCGCCG ACTCATCGCC GAATCAGACG ACGTACCCGA CCCGCTGCTG
CACGAGTTGC GGGCCGCAGC CGACTTGGCC GAACGGAACG GGCTAGCGAT CAGCCTGGTG
ACCATCGGTA CCCCACCACC GCTGCCGGTG CGGATCCGCC GCCGGCTGGC CGACCCGCTG
ACCGCCGCGC TCGCCGAGGC GCGGGACTGG GCTCGGCTGA CCGTGGTGGC CGGCCCGGAC
GAGGTGGCCG TCAGCCTGGT CACCCCGGAT CGCCGGGAGG ACCCCATTCG GTCTGGCGAC
GACAACGGGG ACAGCGACGA AGGGGACAGC GACAGCGAAG GAGACGGTGG GGTGCAGCAC
CTCGACGAAC GGGACGGAAA GATCAGATGG ACGCAGACCC GGTGGCGGCG GTGA
 
Protein sequence
MSADLYPDQT ALRVPQAGPS DAVSAVARAS DRGARIGVVV IALSWHCTIG LPATERIRGE 
LTAPSVVLGT WLLVTVTGVV TGVRLLRGRP LPAWPLAALL LVVDVTVFAA VGREHMFSSV
NWVRGTLGWF FVLILWERRM TALLGMLTAH ALIALAALLA YGMTTAADLA RYTMHVYGVS
SLPVAVAAGS AALATLARKR AQVAATAHAL AAEREAAEQV RQERRDRLGR AGEAAREVLA
ELADGRADPA DPAVQRRCVL AAARLRRLIA ESDDVPDPLL HELRAAADLA ERNGLAISLV
TIGTPPPLPV RIRRRLADPL TAALAEARDW ARLTVVAGPD EVAVSLVTPD RREDPIRSGD
DNGDSDEGDS DSEGDGGVQH LDERDGKIRW TQTRWRR