Gene Sare_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0540 
Symbol 
ID5705640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp611016 
End bp612245 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content68% 
IMG OID641270066 
Producthypothetical protein 
Protein accessionYP_001535460 
Protein GI159036207 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3483] Tryptophan 2,3-dioxygenase (vermilion) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00210127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTGCTTCG TGCGCGAGTT GAACTGTTGG TTGTCTGGCA CCACTGATCC TGCCGATTTT 
CCGTACCTCG CGGTACTGCG CGAATTTCAC GAAGTCGGCA AGCACTTCGT CGAGAAGGAG
ACACTCTCAC TGCTCGACGA GAGTCGCGGC AGGGTGACGG GTCACCCTGC CGCAGGCCAC
GACGACCCGG CGCGGTTGTT GCGCGACTTT CTCGACGTCG CGCTCGACAA ATGGGATGGC
CGCTACGACT ACCGCAGCTA CCTGGCACTG CGCCTGATCG GGCTGTCCGG CGAGGCGGAG
GAACCCACAT TCGGCGGGGA CGACGCCGCC AGGCGTCTGC TCCGTGACCG CCTTGTGGTC
TGCCTGGTCG CCGACGTCCT GAATTTCGAA CTGGCTGCTG CCGCGCACGC CACCACTCTG
CTTCCGCGGC AGCGCCCGGG GCTGACCGTG GTGGCCAAAC GCTGCCGGCT CGGGGTCCGG
GCCGCCCTTC CCGCGCTGGC CCGGCTCGGC CTGACCGGGC TGGTGCAGGA GGGCGAGCCG
ACGTCCGCCG CCGCCACGCT GCACGCCGTC GCCGTCGACC TCGACTCCGT CGGTGCCCTG
CCGCTGCGAC TGAGCATGCT GCCGGTCCAC GTGACCCACG ACGAGTACCT GTTCCTCCGG
GTACTTCAGG CGTACGAGTG CGTCTTCGCC GGTGTCGCCG ACGAACTGCG TGCCGTCATC
GCCGCGATCG GCGTCGACGA CGCTCGTGCG GCAGCCGACC GGTTGGAATA CGCCCGGAAC
CTGATCCTCA ACGCCGGTCC ACTCTTCTCA TTGCTGGCCA CCATGCAGCC GAAGTCGTTT
CAGACATTCC GGCAGTACAC CGAAGGAGCC AGCGCCATCC AGTCACGGTC GTACAAGCTC
GTCGAGTCGT TGTGCCGTGG GCCGGATCAG GACCGGCTCG ACTCGGCCGC GTATGCGGCG
GTGCCGGAAC TGCGGGCCCT GGTCCGGGCT GGTCAGCCGA CGATCGACGA CGCGTACCGG
TCGGCCGTGC GGGATGGGCG ACTCGTCGGC GCGGACCGCG ACCTGATCAC CCGACGGATG
GAGTTGTTCG CCGAGACGCT GCTTCAGTGG CGACGCACCC ACCACCGGAT CGCGGTCCGG
ATGCTCGGTC CCCGACCCGG CACCGGCTAC ACGGAGGGCA CACCGTACCT GGCGGCGGTC
CGTGCCCTGC CGGTCTTCTT CACCGCCTGA
 
Protein sequence
MCFVRELNCW LSGTTDPADF PYLAVLREFH EVGKHFVEKE TLSLLDESRG RVTGHPAAGH 
DDPARLLRDF LDVALDKWDG RYDYRSYLAL RLIGLSGEAE EPTFGGDDAA RRLLRDRLVV
CLVADVLNFE LAAAAHATTL LPRQRPGLTV VAKRCRLGVR AALPALARLG LTGLVQEGEP
TSAAATLHAV AVDLDSVGAL PLRLSMLPVH VTHDEYLFLR VLQAYECVFA GVADELRAVI
AAIGVDDARA AADRLEYARN LILNAGPLFS LLATMQPKSF QTFRQYTEGA SAIQSRSYKL
VESLCRGPDQ DRLDSAAYAA VPELRALVRA GQPTIDDAYR SAVRDGRLVG ADRDLITRRM
ELFAETLLQW RRTHHRIAVR MLGPRPGTGY TEGTPYLAAV RALPVFFTA