Gene Sare_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3043 
Symbol 
ID5707245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3450446 
End bp3451840 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content61% 
IMG OID641272486 
Producthypothetical protein 
Protein accessionYP_001537854 
Protein GI159038601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGGA AGGACACCAA CACTGCGATG AATGATCAGC GTTATGACCA CGAACTGGAT 
CGGCCGGGTG CGCGCTGGGC GAGGTCACGG TGGTGGGCGG TTGGGCTGGC CGGTATGACC
GGCCTGGCTC TCACCGCCAG CCTCGGCATC GGTGGCGCGC CGGCTATCGG CGCCGTCGAC
GGCACTGTCA CCGCCGCCGA CGACCGGCCG GGAAAACCTG ACCGCGCCTC CACCGACAAC
AAGGACCACG AGGGCAAGGC AAAGCAGGGC AAACACAAGG GCACACCGGT CCCGTGTAGC
GCCGATGCGC TGATCGCGGC GATCACCCTG GGCAACGCCC GCGGCGGCGC CGTGCTCGAC
CTCGCCAAGG GCTGCACCTA CCTGCTCACC GCGAACATCG ACGACGGTGC CGGTCTGCCC
GCCATCACCG CCCCGATCAC CCTCAACGGC GGCAAACACA CCAGCATCAC CCGCGCCGCC
GCCGCCGAAC AATTCAGAAT CTTCACCGTC CAGATCGGCG GCCACCTCAC TCTCAACCAC
CTCACCATCA CCGGCGGGGA GATAAACGCC GACGGCGGAG GGGTTCTTGT CATCTCCGGC
GGAGCGCTGA CCACTAACCA CAGCACCATC ACCCGCAACG TCGCGAACGA CGGTGGTGGC
ATCGACAACT TTGGTGTCAC CACTATTAAT CACAGCATCG TCAGCCATAA CATCTCCCAG
GGATTCGGTG GCGGCGTTTC AAACTCTCAG GGAACATTAA GTATCAACAA TTCTCACATA
ACCGCTAACA CGTCCAGCGA AGGCGGCGGC GTAGTGAGTT TCGATATGGC TAGCGCCGTA
ACGATAAGGA AGAGTGTGTT CGCCGACAAT TTCTCCCGGG GAGGGAGCGG AGGCTTGGCT
GTTAGAAGCG GAATCGGTCA AATCTCCGAT ACAACCTTCA CGAACAATCG CGCGAGTAAC
TTCGCTGGTG GAGTCTACAT CGACCGGCCC GCCACTCTGC GGAACGTGGA GATCGTAAAA
AACACGGCGT TAACGCGGAT GGCCGGAGGG CTATTTGTAG ACATTAACGC GGCAGTCGTT
GTTGACAAAA GTTTGATCAA GGACAACGAC TCTATCGCCG CCATCGGTGG CGGCGTATAC
AACACAGGTC AGCTGGTGAT GCGAAAGACA ACGGTCATCG GCAACCGGGC CGACCAAGGC
GGCGGAATCT ACAACGACGC CAACGGTACG CTCCCGCTCT TTTCGACCAA GATTGTCAAG
AATGTCGCCA TCCTCGATGG AGGAGGCATC TTCAACAATG GTGGCACGGT CGAGTTGAAC
ACCGTCACTG GAACCACTGT GGTCAAGAAC CGGCCGGACA ACTGCTCCGG CGACGTACCC
GGCTGCGCCG GATAG
 
Protein sequence
MTGKDTNTAM NDQRYDHELD RPGARWARSR WWAVGLAGMT GLALTASLGI GGAPAIGAVD 
GTVTAADDRP GKPDRASTDN KDHEGKAKQG KHKGTPVPCS ADALIAAITL GNARGGAVLD
LAKGCTYLLT ANIDDGAGLP AITAPITLNG GKHTSITRAA AAEQFRIFTV QIGGHLTLNH
LTITGGEINA DGGGVLVISG GALTTNHSTI TRNVANDGGG IDNFGVTTIN HSIVSHNISQ
GFGGGVSNSQ GTLSINNSHI TANTSSEGGG VVSFDMASAV TIRKSVFADN FSRGGSGGLA
VRSGIGQISD TTFTNNRASN FAGGVYIDRP ATLRNVEIVK NTALTRMAGG LFVDINAAVV
VDKSLIKDND SIAAIGGGVY NTGQLVMRKT TVIGNRADQG GGIYNDANGT LPLFSTKIVK
NVAILDGGGI FNNGGTVELN TVTGTTVVKN RPDNCSGDVP GCAG