Gene Sare_2935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2935 
Symbol 
ID5705240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3324517 
End bp3325728 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content66% 
IMG OID641272384 
Producthypothetical protein 
Protein accessionYP_001537752 
Protein GI159038499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.682348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGGC ATCAGCACGA CGAATCGGCC TGGGTCCTAC GAGGAGGCCT GAATCCGGAT 
ACAGCGAACG TCACCAATGT TCTCACCCAC GTCAGTACCG ACGCAGCGGC AACCCTGTCC
GAGTCGATGG TGTTGGGTAT TGGTGGCGGC CTCGGCGCTG GCTACCTGTT GTGGGAGGTA
AACGGCCAGC TTCCGAGCCT CATCCTCGGA TTTCGTTTTC GCTGGAATAT GCACGACTGG
GCAGAACAGA CCCTGGGGCG GCTCGGGGCG CGCTATCAGG TGCGAACGAC CACGAGCCAG
GCCCGCGCCG CGGCGATGCT GACCGATTCG ATCGAAGCCG GCCGAGCGCC GATCATCCGG
CCCGATCGTC AGTTGCTGAG CTACTGGCAC CTGCACCCTG ACACAGACAC GACCCTGGCT
CCCCCGGTGT ACGCCGGGGC CAGGCGAGGT GCATTGGTAG CCTACGGCCT TGCCGGCGAC
GGCATCCTTG TCGACGACCG AAACCTGTCA CCCCTTACTG TCGACGCCGC GCTGCTCGCA
CAGGCCCGGG CGAAGCTGAG TTCGTCGAAG AACTACATGC TCGTCGTCGA TGCCTTCGAC
ACACCTACTG ACCTGAGCCA GATGATCAGG GCGGGAATCG CCGACTGCGT CGAGCATCTG
CACGCCTCCT CGACCGCTGT CGCCTTGCCA GCCTGGGAGA AGTGGGCAGG CCTGCTTACT
GATCGGCGTA ACGCCAAGGG TTGGCCGAAG GTGTATGCCG AAGGTCGAGG GCTTACCTCG
GCGCTGCTGG CGATCTGGAT GGGCGTCAAT CCCGCCGGCC GTATCGGCGG GGATCTTCGC
GCCTGCTACG CGGACTTCCT CGACGAGGCG GCCGCCCACC TCGGCTCGGC CGAGGCTGCC
GCGACAGCGA CCGCTGATCT CTACCGCATC GCCGCTCGGC GGTGGCAGGA GCTTGCGGAA
GCGGCTCTGC CCAGCGACGT ACCTGAGTTC GCACGGCTAC GGCGGCTCGT CACCGGCATG
TCCGATGGAG TGGTTGCCGG TGACCAAGGC GTTGACGCGC GTGGCGCGGC GGCAACCGAA
CTGTGGACCA TGCTCGCGGA GTACGACGCC GATCCACCGG TCATCGTCGA CCTCGCGACG
CTCGCCGATC GGTTGGGGGC CGTGGCCATG GCGGAGCGTT CGGCAGCAGG ATCCCTTCGT
CAGCTGGTCT AG
 
Protein sequence
MKRHQHDESA WVLRGGLNPD TANVTNVLTH VSTDAAATLS ESMVLGIGGG LGAGYLLWEV 
NGQLPSLILG FRFRWNMHDW AEQTLGRLGA RYQVRTTTSQ ARAAAMLTDS IEAGRAPIIR
PDRQLLSYWH LHPDTDTTLA PPVYAGARRG ALVAYGLAGD GILVDDRNLS PLTVDAALLA
QARAKLSSSK NYMLVVDAFD TPTDLSQMIR AGIADCVEHL HASSTAVALP AWEKWAGLLT
DRRNAKGWPK VYAEGRGLTS ALLAIWMGVN PAGRIGGDLR ACYADFLDEA AAHLGSAEAA
ATATADLYRI AARRWQELAE AALPSDVPEF ARLRRLVTGM SDGVVAGDQG VDARGAAATE
LWTMLAEYDA DPPVIVDLAT LADRLGAVAM AERSAAGSLR QLV